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1.  Introduction 

Prostate  cancer  is  one  of  the  most  prevalent  cancers  in  the  United  States,  where  there  are  more 
than  180,000  new  cases  diagnosed  each  year  and  1  in  7  men  develop  prostate  cancer  during  their 
lifetime  (7).  Over  26,000  men  die  annually  from  prostate  cancer,  accounting  for  8%  of  all  male 
cancer  deaths  (7).  Diagnosis  of  prostate  relies  on  histological  analysis  of  biopsies,  which  uses 
subjective  morphological  and  histological  criteria.  Only  neoplastic  changes  that  result  in  clear 
histological  defects  can  be  detected.  Moreover,  currently  there  is  a  lack  of  reliable  prognostic 
markers  to  distinguish  latent  prostate  cancers,  where  a  watch-and-wait  approach  is  the  best 
strategy  for  the  patient,  from  aggressive  tumors,  which  require  treatment  as  early  in  the 
cancers  life  as  possible.  During  the  DoD  Prostate  Cancer  Idea  Award  funding  period,  our 
objective  was  to  test  the  feasibility  of  using  the  differential  non-random  spatial  organization 
of  the  genome  between  normal  and  cancer  tissue  as  the  basis  of  a  novel,  molecularly  defined, 
diagnostic  and  prognostic  test.  Our  method  is  based  on  the  fact  that  genomes  are  non- 
randomly  organized  with  individual  chromosomes  and  genes  occupying  preferred  positions  within 
interphase  nuclei  (2-4).  Changes  to  these  positioning  patterns  can  occur  in  cancer  (5,  6).  For 
example,  in  pancreatic  cancer,  chromosome  8  shifts  to  a  more  peripheral  location  (7).  Similarly, 
chromosomes  18  and  19  change  nuclear  location  in  several  cancers  types,  including  cervical, 
thyroid  and  colon  (8,  9).  We  have  previously  discovered  several  genes  that  are  repositioned  in 
breast  cancer,  but,  importantly,  not  in  non-cancerous  breast  diseases  (10-12).  Such  changes  in 
spatial  positioning  patterns  are  not  a  reflection  of  a  global  reorganization  of  the  genome  during 
carcinogenesis  and  are  instead  loci-specific,  with  the  position  of  many  genomic  regions  remaining 
conserved  in  cancer  (7,  77,  12).  For  the  DoD  Prostate  Cancer  Idea  Award,  we  aimed  to  exploit 
spatial  genome  reorganization  events  for  the  diagnosis  and,  subsequently,  prognosis  of  prostate 
cancer. 

2.  Key  Words 

Prostate  Cancer;  Cancer  Diagnosis;  Spatial  Gene  Positioning;  Nuclear  Architecture;  Cancer 
Biomarkers,  Prognosis;  FLI1;  MMP9;  MMP2\  SP100. 

3.  Overall  Project  Summary 

Gene  positioning  methodology  (all  tasks) 

The  aim  of  this  project  was  to  identify  genes  that  occupy  distinct  intra-nuclear  positions  in  normal 
and  malignant  cells  and  to  explore  the  usability  of  these  markers  for  diagnostic  purposes.  The 
methods  are  described  in  full  in  the  accompanying  copies  of  the  published  articles  generated  from 
this  work  (10,  13)  (see  Appendices).  Briefly,  we  performed  fluorescence  in  situ  hybridization 
(FISH)  to  detect  individual  genes  in  4-5pm  thick  fonnalin  fixed,  paraffin  embedded  human 
prostate  tissue  sections,  using  protocols  established  for  breast  tissues  and  optimized  for  prostate 
tissues  (10,  11,  13,  14)  (Fig.  1.  A,  B).  The  radial  position  of  a  gene  was  determined  using  a 
previously  developed  image  analysis  method  (10, 11,  13).  Radial  positioning  describes  the  location 
of  a  gene  locus  along  the  axis  from  the  center  of  the  nucleus  to  the  nuclear  periphery  (edge).  To 
account  for  the  fact  that  nuclei  are  not  always  a  regular  elliptical  shape,  a  method  was  developed 
in  which  the  binary  Euclidean  distance  transform  (EDT)  was  computed  for  each  nucleus  (10,  11, 
13).  The  EDT  is  a  morphological  operation  that  assigns  each  pixel  in  a  nucleus  a  value  that  equals 
the  shortest  distance  to  the  edge  of  the  nucleus.  To  account  for  variations  in  nuclear  size,  the  EDT 
of  the  geometric  gravity  center  of  a  FISH  signal  is  nonnalized  to  the  maximum  EDT  value  for  the 
given  nucleus.  Using  this  method,  no  assumption  regarding  nuclear  shape  is  made  when 
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determining  the  radial  position  of  a  gene,  allowing  accurate  comparisons  between  tissues,  even  if 
there  are  differences  or  irregularities  in  nuclear  shape  or  size.  All  alleles  in  a  nucleus  were  included 
in  the  analysis  and  nuclei  were  included  regardless  of  the  number  of  alleles  present,  unless  no  gene 
signals  were  present  in  a  nucleus.  For  each  sample,  data  from  89-263  epithelial  nuclei,  which  were 
acquired  from  multiple  randomly  selected  regions  of  the  tissue  sample,  were  analyzed  and 
combined  to  determine  the  cumulative  relative  radial  distribution  (RRD)  for  each  gene  in  a  tissue. 
The  RRD  is  a  standard  measure  of  a  genes  position  in  a  population  and  is  defined  as  the  statistical 
distribution  of  the  radial  position  of  all  alleles  in  a  cell  population.  RRDs  were  statistically 
compared  to  each  other  using  the  two-sample  1-D  Kolmogorov-Smirnov  test  (KS;  P  <  0.01). 


Figure  1. 


C  |  Candidate  gene  | 

_ l _ 

Initial  screen:  Position  gene  in  a  min. 

3  normal  tissues  and  3  cancer  tissues 
_ t _ 

[Cross-compare  between  individual  tissues  and  to  PND| 

_ 1 _  I _ 

|  Not  repositioned  in  cancer  |  |  Potential  repositioning  in  cancerj 

_ I _ 

Increase  sample  size] 

t 

|Hyperplasic  tissues| 

_ 1 _ 

[Determine  false  positive  and  false  negative  rates  | 

Fig  1.  FISH  was  used  to  detect  gene  loci  in  FFPE  tissue  sections.  Blue,  DAPI  nuclear  counterstain.  Projected  image 
stacks  of  represented  FISH  are  shown.  (A)  HES5  (green)  in  a  normal  prostate  tissue.  Scale  bar,  5pm.  (B)  EGFR  (red) 
and  SPDEF  (green)  gene  loci,  in  a  malignant  prostate  tissue.  Bar,  10pm.  (C)  Outline  of  the  screening  strategy  used  to 
identify  genes  that  reposition  in  prostate  cancer.  PND:  Pooled  normal  distribution.  Fig  adapted  from  (10,  13) 

For  this  DoD  funded  study,  we  analyzed  the  RRDs  of  48  genes  (Table  1)  in  a  panel  of  human 
prostate  tissues  (Table  2).  We  have  previously  identified  a  set  of  eight  genes  ( AKT1 ,  CSF1R, 
ERBB2,  FOL2,  HES5,  FfSP90AAl,  MYC  and  TGFB3 )  that  reposition  in  breast  cancer  (11),  which 
we  refer  to  as  breast  GBPs  (Gene  Positioning  Biomarkers).  To  determine  if  breast  GPBs  would 
also  be  useful  markers  of  prostate  cancer  we  analyzed  the  positions  of  the  breast  GPBs  in  prostate 
tissues  (Published  (10),  appendix).  We  also  screened  a  wide  range  of  genes  in  prostate  tissues, 
which  included  many  of  the  most  important  genes  related  to  prostate  cancer,  such  as  genes  that 
have  been  reported  mis-regulated  in  prostate  cancer  or  participate  in  translocations,  as  well  as  a 
set  of  randomly  selected  genes,  which  were  selected  independently  of  known  changes  in  activity 
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or  involvement  in  prostate  cancer  development  or  progression.  Moreover,  the  test  set  included 
genes  from  21  different  chromosomes  (Published  (13),  appendix). 

Table  1.  Candidate  genes  positioned  in  human  prostate  tissues _ 


Gene 

Chromosomal 

BAC 

Number  of  nuclei 

location 

in  PND 

AKT1 

14q32.33 

RP1 1-47714 

465 

CSF1R 

5q32 

RP1 1-21120 

1397 

ERBB2 

17q21 

RP1 1-62N23 

515 

FOSL2 

2p23.2 

RP1 1-373D23 

1090 

HES5 

lp36.32 

RP3-395M20 

798 

HSP90AA1 

14q32.31 

RP11-367F11 

880 

MYC 

8q24.21 

RP1 1-237F24 

648 

TGFB3 

14q24.3 

RP1 1-270M14 

741 

AR 

Xql2 

RP1 1-479J1 

752 

BCL2 

1 8q2 1 .33 

RP1 1-299P2 

489 

BRCA2 

1 3ql  3 . 1 

RP11-37E23 

518 

CCND1 

1  lql3 

RP1 1-30016 

508 

DCN 

12q2 1 .33 

RP1 1-644B23 

461 

EGFR 

7pl  1.2 

RP1 1-815K24 

612 

ERG 

21q22.2 

RP 11-95121 

646 

ESR2 

14q23.2-q23.3 

RP1 1-14C21 

641 

ETV1 

7p21.2 

RP1 1-692L4 

569 

FGFR1 

8pll.23-pll.22 

RP1 1-100B16 

716 

FGFR2 

10q26.13 

RP1 1-300A10 

740 

FLI1 

1  lq24.3 

RP11-744N12 

1047 

FOXA1 

1 4q2 1 . 1 

RP11-606C5 

411 

FUT4 

1  lq21 

RP11-42H7 

449 

GREB1 

2p25.1 

RP11-50E1 

523 

HOXA9 

7pl5.2 

RP1 1-1 132K14 

505 

KLK3  (PSA) 

19ql3.33 

RP1 1-795B6 

541 

LMNA 

lq22 

RP1 1-35P22 

481 

MATR3 

5q31.2 

RP1 1-815G18 

488 

MMP1 

1  lq22.2 

RP1 1-686G6 

303 

MMP14 

14ql  1.2 

RP1 1-885L10 

324 

MMP2 

16ql2.2 

RP11-90H1 

791 

MMP9 

20ql3.12 

RP1 1-465L10 

1128 

NPM1 

5q35.1 

RP11-117L6 

311 

NUMA1 

1  lql3.4 

RP1 1-449G14 

514 

PADI4/6 

lp36. 1 3 

RP11-1119C2 

647 

PTEN 

10q23.31 

RP11-383D9 

391 

RAF1 

3p25.2 

RP11-148M13 

1027 

SATB1 

3p24.3 

RP1 1-1021 J5 

610 

SERPINB2 

1 8q2 1 .33 

RP1 1-75012 

380 

SLC45A3 

lq32.1 

RP11-6B6 

727 

SP100 

2q37.1 

RP11-727M18 

615 

SPDEF 

6p21.31 

RP11-375E1 

517 

TGFB1 

1 9ql  3 .2 

RP1 1-1012F23 

433 

THBS1 

15ql4 

RP1 1-590D2 

630 

TIMP2 

17q25.3 

RP11-72M9 

697 

TIMP3 

22ql2.3 

RP1 1-641L14 

760 

TMPRSS2 

21q22.3 

RP1 1-671L10 

1260 

VEGFA 

6p21.1 

RP1 1-1 152J4 

1375 

VIM 

1  Op  1 3 

RP1 1-1 12217 

631 

a)  Breast  cancer  GPBs  b)  remaining  genes.  BAC,  Bacterial  artificial  chromosome.  PND,  pooled  normal  distribution.  Table  from  (10,  13) 
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Table  2.  Characterization  of  human  prostate  tissues 
a.  Analysis  of  breast  cancer  GPBs  in  prostate  tissue 


Sample 

code 

Pathology 

Age 

(years) 

Vol  of 

cancer 

(cc) 

Weight  of 
prostate 
(g) 

Stage 

Gleason 

Score 

TNM 

Metastatic 

Source 

Source  specimen 

ID 

P-Cl 

Adenocarcinoma 

73 

II 

6  (3+3) 

T2N0M0 

No 

Biomax 

HuCAT37 1 

P-C2 

Adenocarcinoma 

L.D.T. 

UW7 

P-C3 

Adenocarcinoma 

L.D.T. 

UW8 

P-C4 

Adenocarcinoma 

75 

Biomax 

HuCAT376 

P-C5 

Adenocarcinoma 

72 

Biomax 

HuCAT367 

P-C6 

Adenocarcinoma 

80 

Biomax 

HuCAT366 

P-C7 

Adenocarcinoma 

L.D.T. 

UW3 

P-C8 

Adenocarcinoma 

58 

II 

6  (3+3) 

T2N0M0 

No 

Biomax 

TMA:  PR632, 
core  F3 

P-C9 

Adenocarcinoma 

73 

II 

5  (2+3) 

T2N0M0 

No 

Biomax 

TMA:  PR632, 
core  F5 

P-C10 

Adenocarcinoma 

L.D.T. 

UW24 

P-Cll 

Adenocarcinoma 

50’s 

5.5 

36 

7 

T3N0 

L.D.T. 

UW18 

P-C12 

Adenocarcinoma 

L.D.T. 

UW19 

P-C13 

Adenocarcinoma 

50's 

0.5 

30 

6 

T2N0 

L.D.T. 

UW5 

P-C14 

Adenocarcinoma 

60's 

1 

33 

7 

T2N0 

L.D.T. 

UW11 

P-C15 

Adenocarcinoma 

50's 

2 

40 

7 

T2N0 

L.D.T. 

UW13 

P-C16 

Adenocarcinoma 

50's 

3 

31 

6 

T2N0 

L.D.T. 

UW15 

P-C17 

Adenocarcinoma 

50's 

0.6 

26 

7 

T2N0 

L.D.T. 

UW9 

P-C18 

Adenocarcinoma 

50's 

4.5 

38 

7 

T2N0 

L.D.T. 

UW12 

P-C19 

Adenocarcinoma 

50's 

1.2 

32 

6 

T2N0 

L.D.T. 

UW14 

P-C20 

Adenocarcinoma 

40’s 

3.5 

37 

7 

T2N0 

L.D.T. 

UW16 

P-Bl 

Hyperplasia;  same 
individual  as  C3 

50's 

L.D.T. 

UW20 

TMA:  PR632, 

P-B2 

Hyperplasia 

73 

Biomax 

core  Dl,  D2,  D3 

# 

TMA:  PR632, 
core  El,  E2  # 

P-B3 

Hyperplasia 

74 

Biomax 

P-B4 

Hyperplasia 

78 

Biomax 

TMA:  PR632, 
cores  E7,  E8  # 

P-Nl 

Normal 

28 

Biomax 

HuFPT141 

P-N2 

Normal 

54 

Imgenex 

IMH-1035 

P-N3 

Normal 

35 

Biomax 

TMA:  PR632, 
core  B2 

7 


P-N4 

Normal 

37 

TMA:  PR632, 
core  B8 

P-N5 

Normal 

35 

Biomax 

TMA:  PR632, 
core  C4  or  C6 

P-N6 

Normal 

48/35a 

Biomax 

HuFPT143 

P-N7 

Normal 

64 

BioChain 

T2234201 

P-N8 

Normal 

43 

Biomax 

HuFPT 1 42 

P-N9 

Normal 

L.D.T. 

UW21 

P-N10 

Normal 

L.D.T. 

UW22 

P-Nll 

Normal 

L.D.T. 

UW24 

P-N12 

NAT  (of  P-C2) 

L.D.T. 

UW7 

P-N13 

NAT  (of  P-C3) 

L.D.T. 

UW8 

P-N14 

NAT  (ofP-CIO) 

L.D.T. 

UW17 

P-N15 

NAT  (ofP-Cll) 

50s 

L.D.T. 

UW18 

P-N16 

NAT  (of  P-C12) 

L.D.T. 

UW19 

P-N17 

NAT  (of  P-C13) 

50's 

L.D.T. 

UW5 

P-N18 

NAT  (of  P-C14) 

60's 

L.D.T. 

UW11 

P-N19 

NAT  (of  P-C15) 

50's 

L.D.T. 

UW13 

P-N20 

NAT  (of  P-C16) 

50's 

L.D.T. 

UW15 

P-N21 

NAT  (of  P-C18) 

50's 

L.D.T. 

UW12 

P-N22 

NAT  (of  P-C19) 

50's 

L.D.T. 

UW14 

P-N23 

NAT  (of  P-C20) 

40’s 

L.D.T. 

UW16 

P-N24 

NAT  (of  P-  C17) 

50's 

L.D.T. 

UW9 

b)  Analysis  of  other  genes 


Sample 

code 

Pathology 

Age 

(years) 

Vol  of 

cancer 

(cc) 

Weight  of 

prostate 

(g) 

Stage 

Gleason 

Score 

TNM 

Metastatic? 

Source 

Source  specimen  ID 

Cl 

Adenocarcinoma 

40's 

1 

36 

7 

T2N0 

L.D.T. 

UW1 

C2 

Adenocarcinoma 

60's 

1.6 

47 

9 

T2N0 

L.D.T. 

UW2 

C3 

Adenocarcinoma 

L.D.T. 

UW3 

C4 

Adenocarcinoma 

60's 

0.6 

40 

7 

T2N0 

L.D.T. 

UW4 

C5 

Adenocarcinoma 

50's 

0.5 

30 

6 

T2N0 

L.D.T. 

UW5 

C6 

Adenocarcinoma 

73 

II 

6  (3+3) 

T2N0M0 

No 

Biomax 

HuCAT371 

Cl 

Adenocarcinoma 

58 

II 

6  (3+3) 

T2N0M0 

No 

Biomax 

TMA:  PR632,  core  F3 

C8 

Adenocarcinoma 

73 

II 

5  (2+3) 

T2N0M0 

No 

Biomax 

TMA:  PR632,  core  F5 

C9 

Adenocarcinoma 

62 

II 

7  (3+4) 

T2N0M0 

No 

Biomax 

TMA:  PR632,  core  G2 

CIO 

Adenocarcinoma 

71 

II 

8  (4+4) 

T2N0M0 

No 

Biomax 

TMA:  PR632,  core  G8 

Cll 

Adenocarcinoma 

62 

IV 

7  (3+4) 

T3NlMlb 

Yes 

Biomax 

TMA:  T196,  core  A6 

C12 

Adenocarcinoma 

69 

II 

6  (3+3) 

T2N0M0 

No 

Biomax 

TMA:  T196,  core  B6 

8 


C13 

Adenocarcinoma 

60 

IV 

8  (3+5) 

T3NlMlb 

Yes 

Biomax 

TMA:  T196,  core  D2 

C14 

Adenocarcinoma 

64 

IV 

7  (3+4) 

T3N0Mlb 

Yes 

Biomax 

TMA:  T196,  core  D4 

C15 

Adenocarcinoma 

60's 

3.5 

36 

7 

T3bN0 

L.D.T. 

UW6 

C16 

Adenocarcinoma 

L.D.T. 

UW7 

C17 

Adenocarcinoma 

L.D.T. 

UW8 

C18 

Adenocarcinoma 

50's 

0.6 

26 

7 

T2N0 

L.D.T. 

UW9 

C19 

Adenocarcinoma 

60's 

2.5 

30 

7 

T3N0 

L.D.T. 

UW10 

C20 

Adenocarcinoma 

60's 

1 

33 

7 

T2N0 

L.D.T. 

UW11 

C21 

Adenocarcinoma 

50's 

4.5 

38 

7 

T2N0 

L.D.T. 

UW12 

C22 

Adenocarcinoma 

50's 

2 

40 

7 

T2N0 

L.D.T. 

UW13 

C23 

Adenocarcinoma 

50's 

1.2 

32 

6 

T2N0 

L.D.T. 

UW14 

C24 

Adenocarcinoma 

50's 

3 

31 

6 

T2N0 

L.D.T. 

UW15 

C25 

Adenocarcinoma 

40’s 

3.5 

37 

7 

T2N0 

L.D.T. 

UW16 

C26 

Adenocarcinoma 

L.D.T. 

UW17 

C27 

Adenocarcinoma 

75 

Biomax 

HuCAT376 

C28 

Adenocarcinoma 

50’s 

5.5 

36 

7 

T3N0 

L.D.T. 

UW18 

C29 

Adenocarcinoma 

72 

Biomax 

HuCAT367 

C30 

Adenocarcinoma 

L.D.T. 

UW19 

B1 

Hyperplasia;  same 
individual  as  C3 

50's 

L.D.T. 

UW20 

B2 

Hyperplasia 

70 

Biomax 

TMA:  PR632,  core  D6 

B3 

Hyperplasia 

76 

Biomax 

TMA:  PR632,  cores 
D9,  D8,  D7  # 

B4 

Hyperplasia 

78 

Biomax 

TMA:  PR632,  cores 
E7,  E8  # 

B5 

Hyperplasia 

74 

Biomax 

TMA:  PR632,  core  E2 

N1 

Normal 

28 

Biomax 

TMA:  PR632,  core  A9 

N2 

Normal 

35 

Biomax 

TMA:  PR632,  core  B2 

N3 

Normal 

35 

Biomax 

TMA:  PR632,  core  C6 

N4 

Normal 

38 

Biomax 

TMA:  PR632,  core  C7 

N5 

Normal 

28 

Biomax 

HuFPT 1 4 1 

N6 

Normal 

43/3  la 

Biomax 

HuFPT142 

N7 

Normal 

48/35a 

Biomax 

HuFPT 143 

N8 

Normal 

L.D.T. 

UW21 

N9 

Normal 

L.D.T. 

UW22 

N10 

Normal 

L.D.T. 

UW23 

Nil 

Normal 

54 

Imgenex 

IMH-1035 

N12 

NAT  (of  Cl) 

40's 

L.D.T. 

UW1 

N13 

NAT  (of  C2) 

60's 

L.D.T. 

UW2 

N14 

NAT  (of  C4) 

60's 

L.D.T. 

UW4 

9 


N15 

NAT  (of  C5) 

50's 

L.D.T. 

UW5 

N16 

NAT  (of  C15) 

60's 

L.D.T. 

UW6 

N17 

NAT  (of  Cl 6) 

L.D.T. 

UW7 

N18 

NAT  (of  Cl 7) 

L.D.T. 

UW8 

N19 

NAT  (of  Cl 8) 

50's 

L.D.T. 

UW9 

N20 

NAT  (of  Cl 9) 

60's 

L.D.T. 

UW10 

N21 

NAT  (of  C20) 

60's 

L.D.T. 

UW11 

N22 

NAT  (of  C21) 

50's 

L.D.T. 

UW12 

N23 

NAT  (of  C22) 

50's 

L.D.T. 

UW13 

N24 

NAT  (of  C23) 

50's 

L.D.T. 

UW14 

N25 

NAT  (of  C24) 

50's 

L.D.T. 

UW15 

N26 

NAT  (of  C25) 

40’s 

L.D.T. 

UW16 

N27 

NAT  (of  C26) 

L.D.T. 

UW17 

N28 

NAT  (of  C28) 

50s 

L.D.T. 

UW18 

N29 

NAT  (of  C30) 

L.D.T. 

UW19 

Biomax,  US  Biomax  Inc;  Imgenex,  Imgenex  corporation;  L.D.T.,  Dr.  Lawrence  True;  NAT,  Normal  Adjacent  to  Tumor;  Normal,  Normal  tissue  taken  from  cancer 
free  prostates;  TMA,  Tissue  microarray;  Vol,  volume;  #,  Multiple  cores  of  the  same  tissue  were  used  for  analysis;  “The  individual  used  for  these  two  catalogue 
numbers  changed  with  subsequent  purchases,  consequently  it  is  not  always  tissue  from  the  same  individual  used  between  the  different  genes  for  these  sample  codes. 
Tables  from  (10,  13) 
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We  used  the  same  approach  that  led  to  the  identification  of  positioning  marker  genes  for  breast 
cancer  to  search  for  gene  positioning  biomarkers  (GPBs)  for  prostate  cancer  (Fig.  1  C).  For  an 
initial  screen  of  candidate  genes,  we  assessed  the  position  of  each  gene  in  3-6  cancer  and  3-6 
nonnal  tissues.  In  our  work  on  breast  cancer,  we  established  that  this  sample  size  accurately 
predicts  if  the  given  gene  has  promise  to  be  a  cancer  GPB  (11).  We  focused  on  aggressive  cancers 
for  this  initial  screen,  with  the  majority  of  cancers  used  being  Gleason  score  >7.  We  did  this  to 
bias  our  results  to  find  markers  of  aggressive  cancers,  the  most  clinically  relevant  subtype  of 
prostate  cancer,  since  the  best  marker  currently  available  for  the  aggressiveness  of  a  prostate  cancer 
is  its  Gleason  score,  with  Gleason  score  >7  is  considered  an  aggressive  cancer.  Promising 
candidate  genes  were  further  characterized  in  a  larger  cancer  tissue  set,  which  included  cancers 
with  Gleason  scores  of  5  and  6.  This  will  enable  us  to  assess  if  the  genes  position  can  be  used  as  a 
marker  of  prostate  cancer  in  general  or  specifically  of  aggressive  cancer. 

Taskl:  Identification  of  candidate  makers  based  in  radial  positioning. 

Tasks  4  and  5:  Validation  in  normal  and  cancerous  tissues. 


Conservation  of  positioning  patterns  in  morphologically  normal  prostate  tissues 

It  is  currently  unknown  how  similar  positioning  patterns  are  between  prostate  tissues  from  different 
individuals.  Potential  variability  of  positioning  patterns  is  particularly  relevant  in  prostate,  since 
the  position  of  several  genes  has  been  shown  to  be  sensitive  to  androgen  (15,  16).  Genes  that  are 
highly  variable  between  individuals,  in  non-diseased  tissues,  will  not  be  useful  as  cancer 
bio  markers  since  they  are  likely  to  generate  high  false  positive  rates.  To  detennine  the  degree  of 
variability  in  gene  localization  amongst  individuals,  we  compared  the  positioning  pattern  of  our 
set  of  48  genes  in  morphologically  normal  prostate  tissues  (10,  13).  For  most  of  the  genes  analyzed 
the  radial  distribution  of  the  gene  between  normal  prostate  tissues  was  similar  (Table  3,  4,  Fig.  2, 
3,4,  (10,  13).  For  12  out  of  48  (25.0%)  genes  the  RRDs  were  statistically  identical  between  all 
individuals  (P  >  0.01;  Table  3,  Fig.  2,  3,4).  A  further  25  genes  (52.1%),  although  not  identical 
between  all  comparisons,  were  very  similarly  positioned  between  most  individuals,  with  less  than 
33.4%  of  cross-comparisons  being  statistically  significantly  different  (P  <  0.01).  The  position  of 
22.9%  of  genes  (11/44;  FOL2,  CSF1R,  HSP90AA1,  CCND1,  ERG,  LMNA,  TGFB1,  TIMP2, 
TIMP3,  TMPRSS2  and  VIM)  were  highly  variable  between  individuals,  with  38.9-70%  of  the 
individual  cross-comparison  between  nonnal  tissues  being  significantly  different  FOL2,  CSF1R 
and  HSP90AA1)  (10,  13). 

As  another  measure  of  conservation  in  positioning  patterns  between  individuals,  we  compared  the 
RRD  of  each  gene  in  each  normal  tissue  to  a  pooled  normal  distribution  (PND),  generated  by 
combining  the  RRDs  from  all  nonnal  tissues  analyzed  into  a  single  distribution  for  each  gene,  as 
previously  described  (11).  Again  we  found  a  high  level  of  similarity  between  individuals  (Table 
4,  Figs.  2-4)  (10,  13).  For  30  of  the  48  (62.5%)  genes,  no  individual’s  normal  tissue  was 
significantly  different  than  the  PND  (Table  4,  Figs.  2-4)  (10,  13).  A  further  nine  (18.8%)  genes 
had  only  a  single  normal  tissue  significantly  different  to  the  PND  (7.7-25%  of  normal  tissues); 
while  five  genes,  CSF1R,  FOSL2,  FGFR1,  TMPRSS2  and  RAF  I  had  -one  third  (27.3-37.5%)  of 
nonnal  tissues  significantly  different  to  the  PND  (Table  4,  Figs.  2-4)  (10,  13).  Four  genes 
(HSP90AA1,  CCND1,  TIMP2  and  TIMP3)  (4/48;  8.3%),  however  should  substation  intra¬ 
individual  variation,  and  were  significantly  different  to  the  PND  in  42.9-66.7%  of  normal  tissues 
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(Table  4,  Figs.  2-4)  (10,  13).  Taken  together,  these  data  demonstrate  that  the  radial  positions  of 
most  genes  are  conserved  amongst  prostate  tissue  from  different  individuals.  However,  the 
position  of  a  subset  of  genes  is  variable  between  individuals. 


)  JO  h-  «  o>  o 

:  z  z  z  z  z 

.  CL  CL  CL  CL  0L 


FOSL2 


SOr  Pin 
T-  T-  CM  CM  CM 

z  z  z  z  z 


HES5 

n  in  s  »  a  o 

o  o  o  o  o  o 

Q.  CL  CL  CL  Q.  CL 


HSP90AA1 


ZZZZZZ  2  OOOO 
CL  CL  Q.  Q.  CL  Ql  0.  CL  CL  CL  Q. 


MYC 

to  co  a  'j  <r  m  n 

ZZZZZ  z  OOOOi 
CL  CL  CL  d-  CL  CL  CL  CL  CL  CL  O 


TGFB3 


zzzzz  29999 

CL  CL  cL  CL  CL  0.  CL  O.  CL  CL 


>  I"  '“¥1 

■■  ■  H^:  I 


P-N1 

P-N2 

P-N7 

P-N12 

P-N13 

P-N15 

PND 


■ 


Key: 

P>  0.05 
P<  0.05 
P<0.01 
P<  0.001 
P<  0.0001 


Fig-2.  Breast  cancer  GPBs  do  not  reposition  in  prostate  cancer,  (a)  Cumulative  RRDs  for  the  indicated  genes  in 
prostate  cancer,  normal  tissues  and  the  pooled  normal  distribution  (PND),  (b)  Heat  maps  representing  the  pair-wise 
statistical  comparisons  of  positioning  patterns  of  indicated  genes  between  tissues,  using  the  two-sample  ID  KS  test. 
P-Nl  -  P-N24,  normal  prostate  tissues;  P-Cl  -  P-C20,  cancerous  prostate  tissues  Black  or  white  asterisks  indicate  a 
cross-comparison  between  a  normal  and  cancer  specimen  from  the  same  individual.  Fig  adapted  from  (10). 
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Table  3.  Cross-comparisons  between  individual  tissues  for  breast  cancer  GPB  genes 


Number  of  tissues 

%  (and  number)  of  SD  cross-comparison  between: 

Gene 

Normal 

Cancer 

Individual  normal 

Individual  normal  and 

tissues 

cancer  tissues 

4KT1 

4 

6 

16.7%  (1/6) 

37.5%  (9/24) 

CSF1R 

11 

9 

41.8%  (23/55) 

38.4%  (38/99) 

ERBB2 

4 

5 

33.3%  (2/6) 

35.0%  (7/20) 

FOSL2 

9 

7 

38.9%  (14/36) 

47.6%  (30/63) 

HES5 

6 

6 

20.0%  (3/15) 

27.8%  (10/36) 

HSP90AA1 

7 

4 

57.1%  (12/21) 

46.4%  (13/28) 

MYC 

6 

5 

6.7%  (1/15) 

3.3%  (1/30) 

TGFB3 

6 

4 

33.3%  (5/15) 

25.0%  (6/24) 

4R 

7 

6 

9.5%  (2/21) 

16.7%  (7/42) 

BCL2 

4 

4 

0.0%  (0/6) 

0.0%  (0/16) 

BRCA2 

4 

3 

0.0%  (0/6) 

0.0%  (0/12) 

CCND1 

4 

4 

50.0%  (3/6) 

37.5%  (6/16) 

DCN 

5 

3 

20.0%  (2/10) 

0.0%  (0/15) 

EGFR 

6 

4 

0.0%  (0/15) 

0.0%  (0/24) 

ERG 

5 

3 

50.0%  (5/10) 

40.0%  (6/15) 

ESR2 

5 

4 

0.0%  (0/10) 

0.0%  (0/20) 

ETV1 

5 

3 

0.0%  (0/10) 

0.0%  (0/15) 

FGFR1 

7 

5 

28.6%  (6/21) 

22.9%  (8/35) 

FGFR2 

6 

4 

33.3%  (5/15) 

41.7%  (10/24) 

FLU 

8 

14 

7.1%  (2/28) 

60.7%  (68/112) 

FOXA1 

4 

4 

0.0%  (0/6) 

0.0%  (0/16) 

FUT4 

4 

3 

0.0%  (0/6) 

0.0%  (0/12) 

GREB1 

4 

3 

16.7%  (1/6) 

25.0%  (3/12) 

HOXA9 

4 

3 

0.0%  (0/6) 

8.3%  (1/12) 

KLK3 

5 

4 

20.0%  (2/10) 

30%  (6/20) 

LMNA 

4 

3 

50.0%  (3/6) 

16.7%  (2/12) 

MATR3 

4 

3 

33.3%  (2/6) 

0.0%  (0/12) 

MMP1 

3 

3 

33.3%  (1/3) 

11.1%  (1/9) 

MMP14 

3 

3 

33.3%  (1/3) 

44.4%  (4/9) 

MMP2 

8 

16 

7.1%  (2/28) 

48.4%  (62/128) 

MMP9 

10 

19 

17.8%  (8/45) 

44.2%  (84/190) 

NPM1 

3 

3 

33.3%  (1/3) 

33.3%  (3/9) 

NUMA1 

4 

4 

0.0%  (0/6) 

12.5%  (2/16) 

PAD14/6 

6 

6 

13.3%  (2/15) 

5.6%  (2/36) 

PTEN 

3 

3 

0.0%  (0/3) 

22.2%  (2/9) 

RAF1 

8 

5 

32.1%  (9/28) 

30.0%  (12/40) 

SATB1 

5 

4 

20.0%  (2/10) 

15.0%  (3/20) 

SERPINB2 

4 

4 

16.7%  (1/6) 

6.3%  (1/16) 

SLC45A3 

5 

0 

0.0%  (0/10) 

ND 

SP100 

5 

4 

0.0%  (0/10) 

20.0%  (4/20) 

SPDEF 

5 

4 

10.0%  (1/10) 

5.0%  (1/20) 

TGFB1 

4 

4 

50.0%  (3/6) 

30.0%  (5/16) 

THBS1 

5 

4 

20.0%  (2/10) 

30.0%  (6/20) 

T1MP2 

6 

7 

53.3%  (8/15) 

33.3%  (14/42) 

TIMP3 

6 

4 

60.0%  (9/15) 

58.3%  (14/24) 

TMPRSS2 

6 

4 

60.0%  (9/15) 

50.0%  (12/24) 

VEGFA 

13 

7 

24.4%  (19/78) 

47.3%  (43/91) 

VIM 

5 

4 

70.0%  (7/10) 

40.0%  (8/20) 

a)  Breast  cancer  GPBs  b)  remaining  genes. SD,  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01.  Tables  from 
(10,  13) 
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Identification  of  repositioned  genes  in  prostate  cancer 

Having  established  limited  variability  in  gene  positioning  amongst  individuals,  we  screened  for 
genes  that  reposition  in  prostate  cancer.  To  this  end,  we  compared  the  RRDs  of  47  genes  in 
multiple  cancer  tissue  to  individual  nonnal  tissues  and  to  the  PND  (Tables  3,4,  Figures  2-4)  (10, 
13).  First,  we  determined  how  many  of  the  previously  identified  breast  GPBs  (AKT1,  CSF1R, 
ERBB2,  FOL2,  HES5,  HSP90AA1,  MYC  and  TGFB3  (11))  reposition  in  prostate  cancer,  to  assess 
if  they  would  also  be  useful  markers  of  prostate  cancer.  We  find  that  none  of  these  breast  GPBs 
robustly  reposition  in  prostate  cancer  tissue  (Tables  3a,  4a,  Fig.  2)  (10).  The  majority  of  cross¬ 
comparisons  between  normal  and  prostate  cancer  tissues  were  not  significantly  different  for  any 
of  the  genes  with  between  3.3%-47.6%  cross-comparisons  being  significantly  different.  For  most 
genes  the  number  of  significantly  different  cross-comparisons  between  normal  and  malignant 
prostate  tissues  was  similar  or  less  than  the  proportion  ( 1 6.7%-57. 1  %)  of  cross-comparisons 
between  normal  tissues  that  were  significantly  different,  suggesting  that  the  repositioning  was  not 
cancer  specific  but  rather  related  to  the  variability  in  positioning  patterns  between  individuals 
(Table  3a,  Fig.  2)  (10).  Comparing  the  position  of  genes  in  prostate  cancer  specimens  to  their  PND 
also  identified  only  a  limited  amount  of  repositioning  of  these  genes  in  prostate  cancer  (Table  4a, 
Fig.  2)  (10).  MYC,  TGFB3  and  HES5  repositioned  in  0-33.3%  prostate  adenocarcinoma’s 
compared  to  their  PNDs  and  ERBB2,  FOL2,  CSF1R,  AKT1  and  TGFB3  in  40.0-50.0%  of  cancer 
tissues  compared  to  their  PND  (Table  4a,  Fig.  2)  (10). 

Next,  we  positioned  our  remaining  39  genes  in  cancerous  prostate  tissue  to  identify  prostate  cancer 
biomarkers  (13).  While  the  position  of  nine  genes  (23.1%)  were  indistinguishable  in  all  cross¬ 
comparisons  between  individual  normal  and  cancer  tissues  and  an  additional  19  genes  (48.7%) 
had  only  limited  variability  between  nonnal  and  cancer  tissues,  with  5.0-33.4%  of  cross¬ 
comparisons  reaching  significance,  nine  genes  (23.1%;  CCND1,  ERG,  VIM,  FGFR2,  MMP9, 
MMP14,  VEGFA,  MMP2  and  TMPRSS2)  were  differently  positioned  in  33.4-50.0%  comparisons. 
Two  genes  (5.1%;  TIMP3  and  FL11)  were  repositioned  in  -60%  of  cross-comparisons  between 
nonnal  and  malignant  prostate  tissues  (Table  3b,  Figs.  3,4)  (13).  For  most  genes  the  number  of 
significantly  different  cross-comparisons  between  nonnal  and  cancer  specimens  was  not  above 
that  of  the  cross-comparisons  between  normal  tissues,  suggesting  that  the  repositioning  was  not 
cancer  specific  (Table  3b)  (13).  For  example,  while  TIMP3  was  differently  positioned  in  58.3% 
(14/24)  of  the  cross-comparisons  between  individual  normal  and  cancer  tissues,  60.0%  (9/15)  of 
cross-comparisons  were  statistically  different  when  nonnal  tissues  were  cross-compared  to  each 
other.  Taking  into  consideration  the  degree  of  positioning  changes  in  all  cross-comparisons  and 
the  inter-individual  variability  of  candidates,  we  identified  three  genes,  MMP2,  MMP9  and  FLI1, 
which  exhibited  robust  differential  positioning  between  nonnal  and  cancer  tissues.  For  these  genes, 
the  percentage  of  cross-comparisons  between  normal  and  cancer  tissues  were  higher  by  >  25% 
compared  to  when  normal  tissues  were  compared  to  each  other  (48.4%  vs  7.1%;  44.2%  vs  17.8% 
and  60.7%  vs  7.1%,  respectively;  Table  3b,  Figs.  3,4)  (13). 
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Fig.  3.  Gene  specific  spatial  reorganization  of  the  genome  in  prostate  cancer.  Gene  loci  were  detected  by  FISH  in 
FFPE  prostate  tissue  sections.  Blue,  DAPI  nuclear  counterstain.  Projected  image  stacks  are  shown.  (A)  FLI1,  MMP9 
and  MMP2  gene  loci  (red)  in  normal  and  cancer  tissues.  Bar,  5pm.  (B)  Cumulative  RRDs  for  the  indicated  genes  in 
prostate  cancer  (red),  normal  tissues  (black)  and  the  pooled  normal  distribution  (PND,  blue).  The  positioning  patterns 
of  some,  but  not  all,  genes  are  different  in  prostate  cancer  compared  to  normal  tissues.  RRP,  relative  radial  position. 
Figure  adapted  from  (75). 
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Fig  4.  Gene  specific  repositioning  in  prostate  cancer.  Heat  maps  representing  the  pair-wise  statistical  comparisons  of 
positioning  patterns  of  indicated  genes  between  tissues,  using  the  two-sample  ID  KS  test.  While  positioning  patterns 
are  statistically  similar  (blue,  brown)  between  most  normal  tissues  (N1-N28),  they  can  be  divergent  (red,  orange, 
yellow)  in  cancer  tissues  (C1-C24).  Black  or  white  asterisks  indicate  a  cross-comparison  between  a  normal  and  cancer 
specimen  from  the  same  individual.  PND,  pooled  normal  distribution.  Fig.  adapted  from  (73). 
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Table  4.  Comparison  of  individual  tissues  to  a  pooled  normal 

a) _ 


%  (and  number)  of  SD  cross-comparison  between: 

Gene 

Individual  normal  tissues 

Individual  cancer  tissues  and 

and  pooled  normal 

pooled  normal 

AKT1 

0.0%  (0/4) 

50.0%  (3/6) 

CSF1R 

27.3%  (3/11) 

44.4%  (4/9) 

ERBB2 

0.0%  (0/4) 

40.0%  (2/5) 

FOSL2 

33.3%  (3/9) 

42.9%  (3/7) 

HES5 

0.0%  (0/6) 

33.3%  (2/6) 

HSP90AA1 

42.9%  (3/7) 

50.0%  (2/4) 

MYC 

0.0%  (0/6) 

0.0%  (0/5) 

TCFB3 

16.7%  (1/6) 

25.0%  (1/4) 

b) 

%  (and  number)  of  SD  cross-comparison  between: 

Gene 

Individual  normal  tissues 

Individual  cancer  tissues  and 

and  pooled  normal 

pooled  normal 

AR 

0.0%  (0/7) 

33.3%  (2/6) 

BCL2 

0.0%  (0/4) 

0.0%  (0/4) 

BRCA2 

0.0%  (0/4) 

33.3%  (1/3) 

CCND1 

50.0%  (2/4) 

25.0%  (1/4) 

DCN 

0.0%  (0/5) 

0.0%  (0/3) 

EGFR 

0.0%  (0/6) 

0.0%  (0/4) 

ERG 

20.0%  (1/5) 

33.3%  (1/3) 

ESR2 

0.0%  (0/5) 

0.0%  (0/4) 

ETV1 

0.0%  (0/5) 

0.0%  (0/3) 

FGFR1 

28.6%  (2/7) 

40.0%  (2/5) 

FGFR2 

16.7%  (1/6) 

50.0%  (2/4) 

FLU 

12.5%  (1/8) 

92.9(13/14) 

FOXA1 

0.0%  (0/4) 

0.0%  (0/4) 

FUT4 

0.0%  (0/4) 

0.0%  (0/3) 

GREB1 

25.0%  (1/4) 

0.0%  (0/3) 

HOXA9 

0.0%  (0/4) 

33.3%  (1/3) 

KLK3 

20.0%  (1/5) 

25.0%  (1/4) 

LMNA 

0.0%  (0/4) 

33.3%  (1/3) 

MATR3 

0.0%  (0/4) 

0.0%  (0/3) 

MMP1 

0.0%  (0/3) 

0.0%  (0/3) 

MMP14 

0.0%  (0/3) 

66.7%  (2/3) 

MMP2 

0.0%  (0/8) 

56.3%  (9/16) 

MMP9 

10.0%  (1/10) 

68.4%  (13/19) 

NPM1 

0.0%  (0/3) 

66.7%  (2/3) 

NUMA1 

0.0%  (0/4) 

50.0%  (2/4) 

PADI4/6 

0.0%  (0/6) 

16.7%  (1/6) 

PTEN 

0.0%  (0/3) 

0.0%  (0/3) 

RAF1 

37.5%  (3/8) 

40.0%  (2/5) 

SATB1 

0.0%  (0/5) 

25.0%  (1/4) 

SERP1NB2 

0.0%  (0/4) 

0.0%  (0/4) 

SLC45A3 

0.0%  (0/5) 

ND 

SP100 

0.0%  (0/5) 

25.0%  (1/4) 

SPDEF 

0.0%  (0/5) 

0.0%  (0/4) 

TGFB1 

25.0%  (1/4) 

0.0%  (0/4) 

THBS1 

0.0%  (0/5) 

50.0%  (2/4) 

TIMP2 

50.0%  (3/6) 

28.6%  (2/7) 

TIMP3 

66.7%  (4/6) 

50.0%  (2/4) 

TMPRSS2 

33.3%  (2/6) 

75.0%  (3/4) 

VEGFA 

7.7%  (1/13) 

57.1%  (4/7) 

VIM 

0.0%  (0/5) 

25.0%  (1/4) 

SD,  significantly  different,  based  on  a  KS  test,  P  <  0.01.  ND,  not  determined.  Tables  adapted  from  (10,  13). 
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Repositioning  of  MMP2,  MMP9  and  FLI1  were  confirmed  by  analysis  of  their  distributions  in 
individual  cancer  samples  to  their  PNDs.  Most  genes  in  the  test  set  displayed  limited  repositioning 
in  prostate  cancer  tissues  when  compared  to  their  PNDs.  Two-thirds  (26)  of  genes  were 
repositioned  in  0-33.3%  of  cancer  tissues,  compared  to  the  PND  and  six  genes  repositioned  in  40- 
50%  of  cancers  (Table  4b,  Figs  3,4)  (13).  As  observed  in  the  cross-comparison  analysis,  FLI1, 
MMP9  and  MMP2  showed  robust  repositioning  in  prostate  cancer,  and  their  distributions  were 
distinct  from  their  PNDs  in  92.9%  (13/14  cancers),  68.4%  (13/19)  and  56.3%  (9/16)  of  cancers, 
respectively  (Table  4b,  Figs  3,4)  (13).  Although  a  further  four  genes  were  also  repositioned  in 
more  than  50%  of  cancers  compared  to  the  PND  ( VEGF :  57.1%  (4/7),  MMP14\  66.7%  (2/3); 
NPM1:  66.7%  (2/3);  TMPRSS2:  75%  (3/4))  (Table  4b,  Figs  3,4)  (13),  they  are  not  robustly 
repositioned  in  prostate  cancer  given  that  visual  inspection  of  the  cumulative  frequency 
distribution  plots  reveals  that  no  more  than  a  few  distributions  in  cancer  tissues  fall  outside  the 
range  of  all  nonnal  tissues  (Table  4b  (13).  In  the  vast  majority  (96/125;  76.8%)  of  cases  there  was 
a  similar  repositioning  behavior  when  a  cancer  was  compared  to  its  matched  NAT  or  to  the  PND 
(Figs  3,4)  (13).  This  observation  suggests  that  repositioning,  if  present  in  a  patient,  is  limited  to 
the  cancer  tissue  itself  and  that  comparison  to  the  PND  is  suitable  to  assess  if  a  gene  is  repositioned 
in  cancer,  even  when  no  normal  tissues  from  the  same  individual  is  available.  We  conclude  MMP2, 
MMP9  and  FLI1  undergo  positional  changes  in  prostate  cancer  (13). 

We  were  unable  to  define  any  feature  of  a  prostate  cancer  that  predicted  the  likelihood  of  a  gene 
repositioning  within  it.  For  example,  the  genes  that  reposition  map  to  different  chromosomes 
(FLIP.  HSA11,  MMP9:  HSA20  and  MMP2:  HSA16).  The  repositioning  of  FLI1,  at  least,  is 
unlikely  to  reflect  whole  chromosome  movements  since  the  other  four  genes  analyzed  that  located 
on  HSA11  (CCND1,  FUT4,  MMP1  and  NUMA1)  did  not  repositioned  in  prostate  cancer. 
Biological  function  or  known  relevance  to  tumorigenesis  of  genes  also  did  not  predict 
repositioning  behavior.  Many  of  the  genes  we  positioned  that  are  commonly  implicated  in  prostate 
cancer  (e.g.  KLK3  (PSA),  AR,  PTEN,  TGFB1,  VEGFA  and  BCL2),  and  the  commonly  translocated 
genes  in  prostate  cancer  (TMPRSS2,  ERG  and  ETV1),  do  not  spatially  reposition  in  prostate 
cancers.  Although  an  increase  in  MMP9  and  MMP2  expression  has  been  linked  to  poor  prognosis 
in  prostate  cancer,  both  genes  are  predominantly  expressed  by  the  stromal  cells  and  not  by  the 
tumor  cells  (17),  which  we  analyzed.  Other  MMP  family  members  or  regulators  (e.g.  MMP14 , 
MMP1  TIMP2,  TIMP3  and  SPDEF  do  not  reposition  in  prostate  cancer,  further  reducing  a  link 
between  function  and  cancer-associated  gene  repositioning.  A  lack  of  a  link  between  gene  function 
and  an  altered  radial  position  in  cancer  is  in  line  with  multiple  studies  that  find  no  correlation 
between  changes  in  gene  expression  and  changes  radial  positioning  patterns,  including  a  cell 
culture  model  of  early  breast  cancer  (12,  18-20).  Moreover,  only  two  of  the  eleven  marker  genes 
that  robustly  reposition  in  either  breast  or  prostate  cancer  (18.2%;  MMP9  and  FLIP,  Table  5,  Fig. 
5)  reposition  in  both  breast  and  prostate  cancer  (10,  11,  13),  demonstrating  that  cancer-related 
repositioning  is  not  a  gene-intrinsic  feature,  or  a  more  general  response  to  disease  in  general,  but 
that  distinct  sets  of  genes  reposition  in  cancer  in  a  tissue-of-origin  specific  manner. 


20 


Table  5.  A  comparison  of  the  number  of  breast  and  prostate  cancers  with  gene  repositioning 


events 


Gene 

%  (and  number)  of  SD  cross-comparison  between  individual  cancer 

tissues  and  PND 

Breast  cancer 

Prostate  cancer 

AKT1 

64.3%  (9/14) 

50.0%  (3/6) 

CSF1R 

69.2%  (9/13) 

44.4%  (4/9) 

ERBB2 

71.4%  (10/14) 

40.0%  (2/5) 

FLU 

100.0%  (10/10) 

92.9%  (13/14) 

FOSL2 

69.2%  (9/13) 

42.9%  (3/7) 

HES5 

100.0%  (13/13) 

33.3%  (2/6) 

HSP90AA1 

81.8%  (9/11) 

50.0%  (2/4) 

MMP2 

0.0%  (0/6) 

56.3%  (9/16) 

MMP9 

83.3%  (10/12) 

68.4%  (13/19) 

MYC 

72.7%  (8/11) 

0.0%  (0/5) 

TGFB3 

78.6%  (11/14) 

25.0%  (1/4) 

BCL2 

16.7%  (1/6) 

0.0%  (0/4) 

CCND1 

33.3%  (2/6) 

25.0%  (1/4) 

MMP1 

33.3%  (1/3) 

0.0%  (0/3) 

VEGF 

50.0%  (3/6) 

57.1%  (4/7) 

SD,  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01.  PND,  pooled  normal  distribution.  Red,  the 
gene  has  been  classified  as  being  repositioned  in  breast  and/or  prostate  cancer.  Data  from  (10,  11,  13),  table  from  (10). 
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Figure  5:  Certain  loci  adopt  alternative  nuclear  positions  in  disease  (tick)  compared  to  normal  cells,  whilst  the 
positions  of  other  loci  are  conserved  in  disease  (cross).  The  repositioning  of  some  loci  is  disease  specific,  and  although 
a  gene  repositions  in  one  disease,  it  may  not  in  another  disease.  For  example,  FLII  repositions  in  breast  and  prostate 
cancer,  whereas  MMP2  repositions  in  prostate  disease  but  not  breast  cancer.  Blue  =  Nucleus.  (Fig  adapted  from 
Meaburn,  K.J.  The  Emerging  Role  of  Spatial  Genome  Organization  as  a  Diagnosis  Tool.  Frontiers  in  genetics.  Under 
review. 


21 


Moreover,  the  propensity  of  the  three  marker  genes  to  reposition  in  a  given  cancer  was  not  linked 
to  changes  in  copy  number  (Table  6)  (10,  13).  Consequently,  our  data  demonstrates  that  even  in 
the  background  of  genetic  instability  it  is  still  possible  to  use  gene  positioning  for  cancer 
diagnostics. 


Table  6.  Repositioning  events  are  not  due  to 


Cancer 

FLU 

MMP9 

MMP2 

Cl 

-  58.6% 

C2 

-  64.8% 

C3 

-  49.2% 

-  49.0% 

-  58.3% 

C4 

-  66.4% 

-  50.0% 

-  87.2% 

C5 

-  77.8% 

-  58.1% 

-  52.7% 

C6 

-  87.5% 

C7 

+  40.8% 

++  69.6% 

+  40.3% 

C8 

C9 

CIO 

-  55.4% 

Cll 

C12 

++  57.3% 

+  45.1% 

+  31.6% 

C13 

+  31.9% 

C14 

-  47.4% 

-  50.9% 

C15 

-  56.4% 

-  60.6% 

C16 

-  50.0% 

-  64.4% 

C17 

-  46.0% 

-  61.5% 

C18 

+  48.4% 

C19 

++  60.6% 

C20 

-  40.3% 

C21 

-  44.9% 

C22 

-  46.0% 

C23 

-  62.6% 

C24 

-  74.1% 

variations  in  copy  number 


Non-cancer 

FLU 

MMP9  MMP2 

N 1 

N2 

N3 

N4 

N5 

-  40.8% 

N7 

-  55.8% 

N8 

N12 

-  54.0% 

N13 

-  64.5% 

N14 

-  63.6% 

N15 

-  52.4% 

-  55.2%  -  48.5% 

-  59.2% 

■  47.3% 


■  66.3% 

■  61.4% 

■  52.5% 

■  64.2% 


■  55.1% 


-  58.6%  -  48.0% 


A  comparison  of  the  incidence  of  gene  repositioning  with  the  number  of  FISH  signals  detected  for  a  gene  in  a  given 
tissue.  Black  box:  the  gene  was  reposition  compared  to  the  pooled  normal  distribution  (P  <  0.01;  KS  test);  grey  box: 
no  repositioning  compared  to  the  pooled  normal  (P  >  0.01);  white  box:  gene  positioning  not  determined.  The  copy 
number  of  a  gene  was  diploid  unless  otherwise  stated.  Copy  number  changes  are  denoted  by:  +,  20-49.9%  of  nuclei 
had  >3  gene  FISH  signals;  ++,  >  50%  of  nuclei  had  >3  gene  signal;  >40%  of  nuclei  had  a  single  FISH  signal.  The 
percentage  of  nuclei  with  the  corresponding  number  of  FISH  signals  (+/++/-)  is  also  displayed.  PND,  Pooled  normal 
distribution.  Nl-8,  Normal  tissue,  from  cancer  free  prostates;  N12-N25  =  NAT;  Bl-5,  hyperplasia.  Tissues  are  color- 
coded  to  indicate  tissues  from  the  same  individual  (e.g.  N12  is  the  NAT  from  cancer  Cl).  With  the  exception  ofC3/Bl, 
analysis  for  these  pairs  of  normal  and  cancer  tissues  was  performed  on  the  same  slide  (same  4-5  pm  tissue  section), 
for  a  given  gene.  Table  from  (13). 
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Task  3:  High-throughput  software 

A  major  bottleneck  in  the  analysis  of  these  samples  is  the  image  analysis.  We  have  relied  on  a 
semi-manual  method  to  identify  nuclei  and  FISH  signals  in  the  tissue  sample,  the  large  number  of 
individual  nuclei  (100  per  tissue  sample,  several  hundred  tissues)  exceeded  our  analysis  capacity. 
In  this  semi-automated  analysis  nuclei  must  be  manually  identified  and  segmented.  To  try  to 
overcome  this  problem,  we  have,  in  close  collaboration  with  Dr.  Stephen  Lockett,  NCI,  been 
developing  a  novel  image  analysis  tool.  However,  so  far  we  have  been  unable  to  develop  a  tool 
that  could  segment  nuclei  with  a  high  enough  precision  for  our  use.  Since  the  automated  software 
was  unable  perfonn  at  a  high  enough  level  we  were  unable  to  perform  high-throughput  automated 
image  analysis  and  had  to  only  use  manual  analysis.  This  significantly  slowed  down  our 
throughput,  and  we  have  been  unable  to  validate  the  genes  in  a  large  number  of  specimens  (Task 
4-10). 

Task  4:  Validation  in  benisn  tissue 

Cancer  specificity  of  FLI1  and  MMP9  repositioning 

Given  that  loci  can  alter  their  nuclear  location  in  diseases  other  than  cancer  (11,  21,  22),  we  sought 
to  determine  if  the  repositioning  of  FLI1,  MMP9  and  MMP2  was  specific  to  cancer  or  was  a  more 
general  feature  of  prostate  disease.  We  compared  the  positioning  patterns  of  these  genes  in  five 
benign  hyperplastic  tissues  with  the  positioning  patterns  of  normal  prostate  tissues,  and  between 
multiple  hyperplasia  tissues  (Table  7  and  Figure  6)  (13).  MMP9  and  MMP2  were  positioned 
identically  in  all  hyperplastic  tissues  (Table  7  and  Figure  6)  (13).  The  position  of  FLI1  is  similarly 
conserved  between  hyperplastic  tissues,  with  only  a  single  cross-comparison  reaching  significance 
(1/10;  Table  7  and  Figure  6).  FLI1  and  MMP9  did  not  reposition  in  hyperplasia  compared  with 
nonnal  tissues,  since  the  positioning  of  both  genes  was  indistinguishable  from  the  PND  in  all  five 
hyperplasia  tissues  (Table  7  and  Figure  6)  and  only  10%  (4/40)  and  12%  (6/50)  of  the  cross¬ 
comparisons  between  individual  normal  tissues  reached  significance,  respectively.  These  findings, 
combined  with  the  low  repositioning  rate  in  normal  tissue,  suggests  that  FLI1  and  MMP9 
repositioning  is  specific  to  cancer.  In  contrast,  compared  to  nonnal  tissues  MMP2  was  repositioned 
in  both  prostate  cancer  and  in  hyperplasia.  MMP2  was  in  a  significantly  different  position  in  80% 
(4/5)  of  the  hyperplastic  tissues  compared  to  the  PND  and  in  45.0%  (18/40)  of  cross-comparisons 
between  hyperplasic  and  normal  tissues  (Table  6  and  Figure  7)  (13),  suggesting  that  the 
repositioning  of  MMP2  is  common  in  prostate  disease,  and  not  limited  to  malignancy.  We  had 
access  to  both  hyperplastic  and  cancer  tissue  from  the  same  prostate  (B1  and  C3,  respectively; 
Table  2,  (13));  in  keeping  with  the  comparisons  between  hyperplasia  and  nonnal  tissue,  FLI1  and 
MMP9  occupied  significantly  different  positions  in  the  cancer  and  hyperplasic  tissue  (I  =  0.004 
and  0.002,  respectively),  whereas  MMP2  was  in  a  similar  position  in  the  two  disease  states  (P  = 
0.516),  for  this  individual.  We  conclude  that  repositioning  of  MMP9  and  FLI1  is  cancer  specific, 
whereas  MMP2  repositions  in  both  benign  disease  and  cancer  (Fig.  5)  (13). 
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Figure  6 
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Fig.  6.  Gene  positioning  in  benign  disease.  Positions  of  indicated  genes  were  compared  between  hyperplasia  (B1-B5) 
and  normal  prostate  tissue.  (A)  Cumulative  RRDs  for  the  indicated  genes  in  hyperplasia  (red),  normal  tissues  (black) 
and  the  pooled  normal  distribution  (PND;  blue).  (B)  Pairwise  statistical  comparisons  of  RRDs  between  hyperplasic 
tissues  and  normal  tissues  and  amongst  hyperplasic  tissues,  using  the  two-sample  ID  KS  test.  All  three  genes  are 
similarly  positioned  between  hyperplasic  tissues  and  only  MMP2  repositions  in  hyperplasic  compared  to  normal 
tissues.  RRP,  relative  radial  position.  Figure  adapted  from  (10,  13). 
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Normal  and  NAT  prostate  tissue  often  contain  small  areas  of  mild  hyperplasic  morphology. 
Therefore,  as  a  control  to  ensure  the  heterogeneity  in  positioning  patterns  for  some  genes  in  normal 
prostate  tissues  was  not  a  reflection  of  the  inclusion  of  abnormal  tissue,  we  positioned  one  of  the 
genes  with  a  high  level  of  variability  between  individuals,  CSF1R ,  in  hyperplasic  prostate  tissues. 
CSF1R  was  positioned  identically  amongst  all  hyperplasic  prostate  tissues  (Table  7,  Fig.  6)  (10). 
Similarly,  the  position  of  CSF1R  was  also  similar  between  hyperplasic  and  normal  tissues  with  all 
four  hyperplasic  prostate  tissues  distributions  statistically  similar  to  the  PND  and  only  18.2% 
(8/44)  of  cross-comparisons  between  hyperplasic  and  individual  normal  tissues  being  significantly 
different  (Table  7,  Fig.  6)  (10).  This  suggests  that  the  presence  of  hyperplasic  nuclei  within  a 
nonnal  tissue  RRD  is  not  skewing  spatial  positioning  patterns. 


Table  7.  Comparison  of  gene  positioning  in  benign  tissues 


%  (and  number)  of  SD  cross-comparison  between: 

Gene 

Individual  hyperplasia 

tissues 

Individual  normal  and 
hyperplasia  tissues 

Individual  hyperplasia 
tissues  and  pooled 
normal 

flii 

10.0%  (1/10) 

10.0%  (4/40) 

0.0%  (0/5) 

MMP9 

0.0%  (0/10) 

12.0%  (6/50) 

0.0%  (0/5) 

MMP2 

0.0%  (0/10) 

45.0%  (18/40) 

80.0%  (4/5) 

CSF1R 

0.0%  (0/6) 

18.2%  (8/44) 

0.0%  (0/4) 

SD,  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01.  Table  adapted  from  (10,  13) 


Task  1,  4,  5 

High-confidence  detection  of  prostate  cancers  using  multiplexed  positioning  biomarkers 

Since  FLI1  and  MMP9  repositioning  occurs  predominantly  in  cancer  tissues,  we  determined  their 
false  positive  and  false  negative  rates  as  a  preliminary  step  towards  assessing  their  potential  as 
biomarkers  for  prostate  cancer.  We  defined  a  false  positive  reading  as  repositioning  of  the  marker 
gene  in  a  non-malignant  (nonnal  or  hyperplastic)  tissue  compared  to  the  PND,  thus  incorrectly 
classifying  the  tissue  as  cancer.  Conversely,  we  defined  a  false  negative  reading  as  a  lack  of 
repositioning  of  the  marker  gene  in  a  cancer,  leading  to  incorrect  classification  of  the  tissue  as  non- 
malignant.  As  expected  due  to  its  high  rate  of  repositioning  in  hyperplasia,  MMP2  had  a  high  false 
positive  rate  (30.8%;  Table  8)  (13).  This  finding,  combined  with  its  high  false  negative  rate 
(43.8%;  Table  9)  (13)  suggests  that  MMP2  would  not  be  a  useful  biomarker  for  prostate  cancer.  In 
contrast,  FLI1  had  both  a  low  false  negative  and  false  positive  rate  (7.1%  and  7.7%  respectively; 
Tables  8  and  9)  (13).  MMP9  had  a  fairly  high  false  negative  rate  (31.6%;  Table  9)  (13),  but  a  low 
false  positive  rate  (6.7%;  Table  8)  (13). 


Table  8.  False  positive  rates 


Gene 

Normal  tissue 
false  positives 

Benign  tissue 
false  positives 

Total  false 
positives 

FLII 

12.5%  (1/8) 

0.0%  (0/5) 

7.7%  (1/13) 

MMP9 

10.0%  (1/10) 

0.0%  (0/5) 

6.7%  (1/15) 

MMP2 

0.0%  (0/8) 

80.0%  (4/5) 

30.8%  (4/13) 

The  percentage  (and  number)  of  tissues  that  give  a  false  positive  result.  A  false  positive  is  scored  when  a  gene  has  a 
statistically  significant  different  RRD  in  a  normal  or  hyperplastic  tissue  compared  to  the  pooled  normal  (P  <  0.01;  KS 
test).  Table  from  (13). 
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Table  9  False  negative  rates  for  single  and  multiplexed  genes 

Gene(s)  False  negative  rate 

Single  gene _ 

flii 

MMP9 

MMP2 _ 

Multiplexed  genes 

FLII  and  MMP9  0.0%  (0/11) 

FLII  and  MMP2  9.1%  (1/1 1) 

MMP9  and  MMP2 _ 28.6%  (4/14) _ 

The  percentage  (and  number)  of  tissues  that  give  a  false  negative  result.  A  false  negative  is  scored  when  the  RRD  of 
a  gene  in  a  cancer  tissue  is  similar  to  that  of  the  pooled  normal  distribution  (KS  test,  P>  0.01).  For  multiplexed  genes, 
a  false  positive  is  scored  when  neither  gene  in  the  pair  is  repositioned  compared  to  the  pooled  normal  distribution  (P 
>  0.01,  KS  test).  Only  tissues  where  both  genes  of  the  pair  were  positioned  have  been  included.  Table  from  (13). 


7.1%  (1/14) 
3 1.6%  (6/19) 
43.8%  (7/16) 


None  of  the  genes  we  tested  repositioned  in  all  prostate  cancer  tissues.  However,  at  least  one 
repositioned  gene  was  present  in  every  prostate  cancer  tissue  analyzed  (30/30  cancers;  Fig.  4)  (13). 
Given  the  low  false  positive  rate  of  FLII  and  MMP9,  we  explored  the  possibility  of  using  these 
two  genes  in  a  multiplex  format  for  the  detection  of  cancer  samples.  When  analyzed  together,  FLII 
or  MMP9  (or  both)  repositioned  in  all  (11/11)  cancers  (Table  10,  Fig.  4)  (13).  Consequently, 
multiplexing  FLL1  and  MMP9  reduced  the  false  negative  rate  to  0%  (0/11;  Table  9)  (13).  We 
conclude  that  in  combination  FLII  and  MMP9  are  strong  potential  biomarkers  for  prostate  cancer 
(13). 


Table  10.  Gene  Multiplexing 


Gene 

MMP9 

MMP2 

FLII 

100.0%  (11/11) 

90.9%  (10/11) 

MMP9 

71.4%  (10/14) 

The  percentage  (and  number)  of  cancers  where  at  least  one  of  the  indicated  pair  of  genes  repositioned,  compared  to 
the  pooled  normal  distribution  (P  <  0.01,  KS  test).  Only  tissues  where  both  genes  of  the  pair  have  been  positioned  are 
included.  Table  from  (13). 


Determination  of  prognostic  potential  for  spatial  genome  organization  biomarkers 

Since  none  of  our  prostate  cancer  GPBs.  FLII,  MMP9  and  MMP9,  repositioned  in  all  cancers  we 
further  characterized  their  spatial  repositioning  patterns  to  detennine  if  they  were  candidate 
biotnarkers  for  prognostic  applications. 

Task  6:  Marker  of  tumor  grade 

We  first  asked  if  gene  repositioning  could  distinguish  between  Gleason  grades.  FLII  and  MMP2 
cannot  distinguish  between  Gleason  grade,  and  were  repositioned,  when  compared  to  the  PND,  at 
a  similar  frequency  between  Gleason  grades  (Table  2,  1 1,  Figure  2-4,  7)  (13).  FLII  repositioned 
in  5  out  of  5  Gleason  grade  5  and  6  cancers,  4/5  Gleason  grade  cancers  and  3/3  Gleason  grade  8 
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and  9  cancers  (Table  1 1)  (13).  Similarly,  MMP2  repositioned  in  4/7  Gleason  grade  5  and  6  cancers, 
2/4  Gleason  grade  cancers  and  1/2  Gleason  grade  8  and  9  cancers  (Table  11)  (13).  Conversely, 
MMP9  showed  an  increase  incidence  of  repositioning  in  Gleason  grade  7  cancers  (8/9;  88.9%) 
compared  to  low  Gleason  score  cancers  ( 1/5;  20%)  (Table  1 1)  (13).  We  also  selected  an  additional 
gene  for  further  analysis.  Form  our  initial  analysis  SP100  repositioned  in  one  Gleason  score  6 
cancer  but  not  in  three  Gleason  score  7  cancers  (13).  We  increased  the  number  of  normal  and 
cancer  tissues  analyzed  (Fig.  8  and  9).  As  further  evidence  that  the  PND’s  sample  size  appropriate, 
the  addition  of  two  normal  tissues  (239  nuclei)  gave  a  statically  identical  PND  to  our  original  PND 
(P  0.83,  Fig.  8;  unpublished  data).  As  opposed  to  MMP9,  SP100  repositioned  in  low  score  (5-6) 
but  not  intermediate  score  (7)  prostate  cancers  (Fig.  9,  unpublished  data)  SP100  repositioned  in 
66.7%  (4/6)  Gleason  score  <6  cancers,  but  does  not  reposition  in  Gleason  score  7  cancers  (0/3; 
Fig.  9,  Unpublished  data).  Future  studies  are  required,  to  increase  the  sample  size,  and  to  increase 
the  range  of  Gleason  scores  compared. 


Figure  7 


Periphery  RRP  Center 


Fig.  7.  Gene  repositioning  in  prostate  cancer  is  not  linked  to  prognostic  status.  Cumulative  RRDs  are  color-coded 
according  to  the  prognostic  status  of  the  cancer:  Blue,  indolent  prostate  cancers  (Gleason  score  <6,  without  metastasis); 
green,  intermediate  cancers  (Gleason  score  7,  without  metastasis)  and  red,  aggressive  cancers  (Gleason  score  >8, 
and/or  metastasis).  The  pooled  normal  distribution  (black)  is  included  for  comparison.  RRP,  relative  radial  position. 
Figure  adapted  from  (13). 

Figure  8 


RRDT 

Periphery  Center 

Fig.  8.  Cumulative  RRDs  for  the  of  SP100  when  5  (black  (13))  or  7  (red  (unpublished  data))  normal  tissues  are  used 
to  generate  the  pooled  normal  distribution  (PND).  Statistical  comparison  by  two-sample  ID  KS  test.  RRP,  relative 
radial  position. 
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Table  11.  Characterization  of  repositioning  events  by  Gleason  grade  and  other  markers  of 
risk. 


Gene 

Total 

Gleason 
score  (GS)  7 

GS  5  or 

6 

GS  8  or 

9 

T3 

“aggressive”  / 
’’high  risk” 

Unknown 

AR 

6(2) 

4(2) 

2 

BCL2 

4(0) 

2 

1 

1 

2 

BRCA2 

3(1) 

1(1) 

2 

CCND1 

4(1) 

2(1) 

1 

1 

2 

DCN 

3(0) 

2 

1 

1 

1 

EGFR 

4(0) 

2 

2 

ERG 

3(1) 

2(1) 

1 

ESR2 

4(0) 

3 

2 

2 

1 

ETV1 

3(0) 

3 

FGFR1 

5(2) 

4(1) 

2  (1) 

2(1) 

1(D 

FGFR2 

4(2) 

3(2) 

1 

FLU 

14(13) 

5(4) 

5(5) 

3(3) 

3(2) 

3(2) 

5(4) 

1 

FOXA1 

4(0) 

3(2) 

1 

1 

1 

FUT4 

3(0) 

2 

1 

GREB1 

3(0) 

1 

2 

HOXA9 

3(1) 

3(1) 

1 

1 

KLK3  (PSA) 

4(1) 

2(1) 

1 

1 

1 

1 

LMNA 

3(1) 

3(1) 

1(1) 

1(1) 

MATR3 

3(0) 

2 

1 

MMP1 

3(0) 

2 

1 

MMP14 

3(2) 

1 

2(2) 

MMP2 

16(9) 

4(2) 

7(4) 

2(1) 

3(1) 

4(2) 

5(3) 

3(2) 

MMP9 

19(13) 

9(8) 

5(1) 

2(1) 

3(2) 

5(4) 

6(5) 

3(3) 

NPM1 

3(2) 

2(2) 

1 

1(1) 

1(1) 

NUMA1 

4(2) 

3(1) 

1(1) 

2(1) 

2(1) 

PADI4/6 

6(1) 

5(1) 

1(1) 

1(1) 

1 

PTEN 

3(0) 

2 

1 

RAF1 

5(2) 

3(2) 

2(2) 

2(2) 

3 

SATB1 

4(1) 

3(1) 

1 

1(1) 

1(1) 

SERPINB2 

4(0) 

2 

2 

SP100 

4(1) 

3 

1(D 

1 

1 

SPDEF 

4(0) 

2 

2 

TGFB1 

4(0) 

2 

2 

2 

2 

THBS1 

4(2) 

3(1) 

1(1) 

1 

2(1) 

TIMP2 

7(2) 

4(2) 

1 

1 

2 

3 

1 

TIMP3 

4(2) 

3(1) 

1(1) 

1(1) 

2(2) 

TMPRSS2 

4(3) 

3(2) 

1(1) 

1(D 

1(1) 

VEGFA 

7(4) 

6(3) 

2 

2(1) 

1(D 

VIM 

4(1) 

1 

1 

1 

3(1) 

AKT1 

CSF1R 

ERBB2 

FOSL2 

HES5 

HSP90AA1 

MYC 

TGFB3 

6(3) 

9(4) 

5(2) 

7(3) 

6(2) 

4(2) 

5(0) 

4(1) 

1(1) 

3(1) 

4(2) 

3 

1 

3 

3(1) 

2 

1(1) 

1 

2(1) 

5(3) 

5(3) 

5(2) 

1(1) 

3(1) 

1 

2 

The  number  of  tissue  analyzed  for  each  gene.  The  numbers  in  parentheses  indicate  the  number  of  cancers  where  the 
gene  repositioned,  compared  to  the  pooled  normal  distribution  (P  <  0.01,  KS  test).  Where  there  is  no  parentheses  the 


gene  did  not  repositioned  in  any  cancers  analyzed.  GS,  Gleason  score,  Met,  metastatic  cancer,  T3,  TNM  classification, 
T3  is  a  marker  of  a  cancer  at  high  risk.  Table  using  data  from  (10,  13). 
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Figure  9 
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Fig.  9.  The  spatial  position  of  SP100  is  a  marker  of  Gleason  score,  but  not  aggressive  prostate  cancer.  A)  Cumulative 
RRDs  are  color-coded  according  to  the  Gleason  score  (gs)  of  the  cancer:  Gleason  score  5  or  6  (blue),  Gleason  score  7 
(purple),  normal  tissues  (black)  and  the  pooled  normal  distribution  (PND;  yellow).  Pairwise  statistical  comparisons  of 
RRDs  between  (B)  cancer  tissues  and  the  PND,  or  (C)  individual  normal  and  cancer  tissues,  using  the  two-sample  ID 
KS  test.  RRP,  relative  radial  position;  gg,  Gleason  grade;  s,  Stage,  +,  20-49.9%  of  nuclei  had  >3  gene  FISH  signals; 
++,  >  50%  of  nuclei  had  >3  gene  signal;  -,  >40%  of  nuclei  had  a  single  FISH  signal.  Unpublished  Data. 
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Task  7.  Detection  of  early  cancer 

Prostate  cancers  with  Gleason  scores  of  <6  are  classified  as  low  grade,  early  cancers.  Therefore, 
given  the  results  described  above  SP100  and  MMP9  are  candidate  markers  to  distinguish  early 
prostate  cancer,  since  SP100  repositions  in  <6  prostate  cancers,  but  not  in  Gleason  score  7  cancers 
and,  conversely,  MMP9  predominantly  repositions  in  Gleason  score  >7  cancers  (Table  11,  Fig  2- 
4,  9)  (73)  (unpublished  data).  Future  studies  will  be  required  to  validate  this  finding  in  larger 
numbers  of  tissues,  particularly  in  cancers  with  a  large  range  of  Gleason  scores.  Most  likely,  the 
combined  use  of  these  2  genes  will  be  a  stronger  biomarker  than  either  used  singularly. 

Task  9:  metastatic  marker  potential 

The  variation  in  the  positioning  patterns  of  FLI1,  MMP9,  MMP2  and  SP100  did  not  correlate  with 
the  clinicopathologically  defined  aggressiveness  of  the  cancer  (Table  11,  Fig.  7  and  9)  (73) 
(Unpublished  data).  Neither  the  propensity  to  reposition  nor  the  direction/degree  of  repositioning 
could  be  used  to  distinguish  metastatic  cancers  from  non-metastatic  cancers  using  the  radial 
positioning  patterns  of  FLI1,  MMP9,  MMP2  or  SP100  (Table  11,  Fig.  7  and  9)  (73)  (Unpublished 
data). 

Task  2,  8,10:  Nothing  to  report. 

No  cost  extension  tasks: 

Task  IF  Grant  reports  finalized. 

Task  12:  The  2  manuscripts  based  on  aims  1  and  2  were  successfully  finalized,  and  have  now  been 
published  in  peer-review  journals ,  Molecular  Biology  of  the  Cell  and  Ftistochem  Cell  Biol  (10,  13) 
(Appendix).  Additional  data  generated  for  this  task  is  included  in  the  above  discussion  of  the 
resulsts,  since  this  data  was  the  analysis  of  the  positioning  of  our  candidate  genes  additional  tissues. 
Task  13 :  SP 100  data  as  presented  in  Task  6  and  9. 


Summary 

We  have  identified  two  genes,  MMP9  and  FLI1,  which  undergo  cancer-specific  repositioning, 
which  can  be  used  to  distinguish  cancerous  prostate  tissue  from  normal/hyperplasic  tissue  with 
high  accuracy.  We  conclude  that  the  spatial  positioning  patterns  of  MMP9  and  FLI1,  especially 
when  used  in  combination,  are  strong  candidate  biomarkers  for  use  in  cancer  diagnosis  (73).  Future 
large-scale  studies  will  be  required  to  further  validate  their  diagnostic  potential. 
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4.  Key  Research  Accomplishments 

•  Interphase  spatial  positioning  patterns  of  48  genes  have  been  screened  in  a  panel  of  normal 
and  cancerous  human  prostate  tissues  to  identify  candidate  marker  genes  for  prostate 
cancer  detection. 

•  Demonstration  of  minimal  variation  in  the  spatial  position  of  a  most  gene  amongst  normal 
individuals.  Thus,  any  repositioning  between  normal  and  cancer  tissues,  for  most  genes, 
are  specific  to  disease  and  are  not  a  consequence  of  inter-individual  differences  in 
positioning  patterns. 

•  Demonstration  of  gene-specific  repositioning  events  associated  with  prostate 
carcinogenesis. 

•  Demonstration  of  an  absence  of  global  genome  reorganization  in  prostate  cancer  cells. 

•  Identification  of  three  potential  prostate  cancer  maker  genes  {FLI1,  MMP9,  MMP2 ). 

•  Demonstration  that  the  repositioning  events  in  cancer  are  not  a  consequence  of  genomic 
instability. 

•  Establishment  of  a  pooled  normal  distribution  for  use  as  a  standard  for  comparison  with 
unknown  samples. 

•  Determination  of  false  positive/negative  rates. 

•  Demonstration  that  multiplexing  improves  sensitivity. 

•  Determination  that  different  sets  of  genes  reposition  in  breast  and  prostate  cancer. 

•  Validation  that  FLI1  and  MMP9  gene  repositioning  events  are  specific  to  cancer,  whereas 
MMP2  repositioning  events  are  specific  to  disease. 

•  Proof-of-principle  that  spatial  gene  positioning  has  prognostic  potential  since  SP100  and 
MMP9  differentially  repositions  in  low  grade  and  intermediate  grade  prostate  cancers. 


5.  Conclusions 

We  have  developed  a  strategy  to  identify  a  novel  class  of  prostate  cancer  biomarkers,  based  on  the 
differential  spatial  localization  of  genes  within  the  cell  nucleus.  Application  of  this  strategy  has 
led  to  the  identification  and  partial  characterization  of  three  promising  novel  prostate  cancer 
biomarkers  {FLI1,  MMP9,  MMP2),  which  warrant  further  investigation.  We  tested  their  usefulness 
in  human  prostate  formalin- fixed  paraffin  embedded  tissues.  We  adapted  the  approach  to  the 
requirements  in  a  clinical  setting  by  developing  a  nonnalized  standard  reference  distribution 
(PND)  for  each  gene.  We  found  that  FLI1,  MMP9  and  MMP2  repositioned  in  a  significant  fraction 
of  prostate  cancers,  compared  to  the  normal  standard.  The  sensitivity  of  detecting  prostate  cancer 
was  markedly  improved  by  multiplexing  markers.  The  repositioning  of  FLI1  and  MMP9  was 
specific  to  cancer,  and  had  low  false  positive  rates.  MMP2  spatial  positioning  is  a  marker  of 
prostatic  disease,  but  not  specifically  cancer,  since  it  also  repositioned  in  BPH.  Spatial  gene 
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repositioning  in  prostate  cancer  is  highly  gene  specific,  and  not  a  reflection  of  a  global 
reorganization  of  the  genome.  These  repositioning  events  were  also  not  due  to  inter-individual 
variations  in  gene  positioning,  since  the  positioning  pattern  of  most  genes  was  highly  similar 
between  normal  tissues  from  different  individuals.  Nor  were  the  repositioning  events  correlated 
with  genomic  instability.  We  also  provide,  for  the  first  time,  a  proof-of-principle  that  spatial 
positioning  of  the  genome  has  can  be  used  to  stratify  prostate  cancers,  and  therefore  are  potential 
prognostic  markers,  since  SP100  only  repositioned  in  Gleason  score  5  and  6  cancers,  whereas 
MMP9  repositioned  more  frequently  in  Gleason  score  7  cancers.  Further  large-scale  studies,  using 
a  wide  range  of  cancers,  are  required  to  validate  our  findings  and  to  identify  prognostic  makers. 

In  the  long-term,  our  efforts  should  lead  to  the  development  of  a  robust,  standardized  method  for 
the  detection  of  prostate  cancer  in  a  routine  diagnostic  laboratory  setting.  While  conventional 
prostate  cancer  diagnosis  is  largely  based  on  qualitative  morphological  criteria,  our  assay  provides 
highly  quantitative  method  for  the  detection  of  cancerous  cells.  Analysis  of  gene  positioning 
patterns  promises  to  be  a  sensitive  and  effective  diagnostic  approach  for  prostate  cancer.  Although 
the  marker  genes  thus  far  identified  did  not,  spatial  positioning  patterns  also  have  the  promise  to 
stratify  aggressive  and  indolent  prostate  cancer  and  to  act  as  robust  prognostic  markers.  Consistent 
with  this  hypothesis,  we  find  differences  in  gene  positioning  patterns  between  individual  cancers, 
different  genes  reposition  in  prostate  cancers  to  those  that  reposition  in  breast  cancer  and,  most 
importantly,  the  positioning  patterns  of  MMP9  and  SP100  can  be  used  to  sub-type  Gleason  score 
5  and  6  cancer  from  Gleason  score  7  cancers.  Our  approach  overcomes  several  of  the  limitations 
of  current  and  currently  proposed  diagnostic  tests  since  it  is  i)  highly  quantitative,  ii)  based  on 
single  cell  analysis,  iii)  applicable  to  extremely  small  tissue  samples,  thus  reducing  the  requirement 
for  additional  exploratory  invasive  procedures,  iv)  is  independent  from  the  generation  of 
metaphase  chromosome,  which  can  be  difficult  to  obtain  from  solid  tumors  and  v)  is  insensitive  to 
protein  and  RNA  degradation,  which  commonly  occurs  during  biopsy  sample  handling,  unlike 
immunohistochemistry-,  PCR-,  or  microarray-based  diagnostic  approaches.  Moreover,  our  method 
of  diagnostics  can  easily  be  integrated  into  clinical  laboratories  as  an  extension  to  existing  routine 
cytogenetic  procedures  using  FISH  to  detect  gene  amplifications  in  solid  tumors.  Our  assay  will 
extend  and  complement  conventional  morphology-based  diagnostics  and  it  is  anticipated  that  the 
combined  use  of  standard  pathological  indicators  and  our  method  will  be  a  highly  accurate, 
quantitative  and  powerful  diagnostic  approach. 
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ABSTRACT  Genes  occupy  preferred  spatial  positions  within  interphase  cell  nuclei.  However, 
positioning  patterns  are  not  an  innate  feature  of  a  locus,  and  genes  can  alter  their  localization 
in  response  to  physiological  and  pathological  changes.  Here  we  screen  the  radial  positioning 
patterns  of  40  genes  in  normal,  hyperplasic,  and  malignant  human  prostate  tissues.  We  find 
that  the  overall  spatial  organization  of  the  genome  in  prostate  tissue  is  largely  conserved 
among  individuals.  We  identify  three  genes  whose  nuclear  positions  are  robustly  altered  in 
neoplastic  prostate  tissues.  FLI1  and  MMP9  position  differently  in  prostate  cancer  than  in 
normal  tissue  and  prostate  hyperplasia,  whereas  MMP2  is  repositioned  in  both  prostate  can¬ 
cer  and  hyperplasia.  Our  data  point  to  locus-specific  reorganization  of  the  genome  during 
prostate  disease. 
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INTRODUCTION 

The  interphase  nucleus  is  a  highly  organized  organelle,  with  genes 
and  chromosomes  occupying  preferred  locations  within  the  cell 
nucleus  (Ferrai  et  a/.,  201 0;  Meaburn  et  a/.,  201 6).  Although  the  spa¬ 
tial  organization  of  the  genome  is  largely  conserved  among  indi¬ 
viduals  (Borden  and  Manuelidis,  1988;  Wiech  et  a/.,  2005;  Murata 
et  a/.,  2007;  Meaburn  et  a/.,  2009;  Timme  et  a/.,  201 1),  nuclear  loca¬ 
tion  is  not  an  inherent  feature  of  a  locus,  since  gene  loci  can  occupy 
distinct  positions  depending  on  context  or  conditions.  For  example, 
gene  positions  may  differ  among  tissue  types  during  differentiation 
or  as  cells  become  quiescent  or  senescent  (Bridger  et  a/.,  2000; 
Boyle  eta/.,  2001;  Kosak  eta/.,  2002;  Parada  eta/.,  2004;  Meaburn 
and  Misteli,  2008;  Meaburn  et  a/.,  2016;  Takizawa  et  a/.,  2008a; 
Ferrai  et  a/.,  2010;  Peric-Hupkes  et  a/.,  2010;  Foster  et  a/.,  2012). 


This  article  was  published  online  ahead  of  print  in  MBoC  in  Press  (http://www 
.molbiolcell.org/cgi/doi/10.1091/mbc.E15-05-0280)  on  November  12,  2015. 
Address  correspondence  to:  Tom  Misteli  (mistelit@mail.nih.gov),  Karen  J. 
Meaburn  (meaburnk@mail.nih.gov). 

Abbreviations  used:  EDT,  Euclidean  distance  transform;  FFPE,  Formalin-fixed, 
paraffin-embedded;  FISH,  fluorescence  in  situ  hybridization;  HSA,  human  chro¬ 
mosome;  KS,  Kolmogorov-Smirnov;  NAT,  normal  adjacent  to  tumor;  PND, 
pooled  normal  distribution;  RRD,  relative  radial  distribution. 

©  2016  Leshner  et  al.  This  article  is  distributed  by  The  American  Society  for  Cell 
Biology  under  license  from  the  author(s).  Two  months  after  publication  it  is  avail¬ 
able  to  the  public  under  an  Attribution-Noncommercial-Share  Alike  3.0  Unport¬ 
ed  Creative  Commons  License  (http://creativecommons.org/licenses/by-nc 
-sa/3.0). 

"ASCB®,"  "The  American  Society  for  Cell  Biology®,"  and  "Molecular  Biology  of 
the  Cell®"  are  registered  trademarks  of  The  American  Society  for  Cell  Biology. 


Although  the  molecular  mechanisms  that  govern  the  nuclear  posi¬ 
tion  of  a  gene  are  unknown,  changes  in  gene  positioning  are  some¬ 
times,  but  not  always,  correlated  with  functional  changes,  such  as 
altered  transcription  levels  or  replication  timing  or  with  epigenetic 
changes  to  chromatin  (Kosak  et  a/.,  2002;  Williams  et  a/.,  2006; 
Hiratani  et  al.,  2008;  Meaburn  and  Misteli,  2008;  Meaburn  et  a/., 
2016;  Takizawa  et  al.,  2008a;  Morey  et  al.,  2009;  Ferrai  et  al.,  2010; 
Peric-Hupkes  et  al.,  2010;  Towbin  et  al.,  2012;  Kind  et  al.,  2013; 
Therizols  et  al.,  2014;  Rafique  et  al.,  2015). 

There  is  accumulating  evidence  that  spatial  reorganization  of  the 
genome  is  a  pathological  feature,  with  repositioning  of  individual 
genes  or  genomic  regions  detected  in  a  wide  range  of  diseases, 
including  epilepsy  (Borden  and  Manuelidis,  1988),  laminopathies 
(Meaburn  et  al.,  2007;  Mewborn  et  al.,  2010;  Mehta  et  al.,  2011), 
parasitic  or  viral  infection  (Li  eta/.,  2010;  Knight  et  al.,  201 1 ;  Arican- 
Goktas  eta/.,  2014),  Down  syndrome  (Paz  eta/.,  2015),  endometrio¬ 
sis  (Mikelsaar  et  al.,  2014),  and  cancer  (Cremer  et  al.,  2003;  Murata 
et  al.,  2007;  Meaburn  and  Misteli,  2008;  Meaburn  eta/.,  2009;  Zeitz 
et  al.,  201 3).  For  example,  in  cultured  fibroblasts  from  laminopathy 
patients,  some  chromosomes,  such  as  chromosomes  (HSAs)  13  and 
18,  occupy  altered  nuclear  positions,  although  others,  including 
HSAX  and  HSA4,  are  unaffected  (Meaburn  et  al.,  2007;  Mewborn 
et  al.,  2010;  Mehta  et  al.,  201 1).  Similarly,  in  brain  tissue  from  epi¬ 
lepsy  patients,  the  centromere  of  HSAX  is  relocated  but  1  q  1 2,  9q1 2, 
and  Y12  are  not  (Borden  and  Manuelidis,  1988). 

Spatial  genome  reorganization  has  been  associated  with  cancer. 
HSA18  and  HSA19,  which  in  many  cell  types  are  peripherally  and 
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internally  located,  respectively,  broadly  maintained  their  positions  in 
cell  lines  derived  from  various  cancers,  including  colon  and  cervical 
cancer,  melanoma,  and  Hodgkin's  disease,  although  they  tend  to  be 
in  closer  proximity,  and  a  small  fraction  of  nuclei  have  inverted  the 
locations  of  these  two  chromosomes  (Boyle  et  a/.,  2001 ;  Cremer 
etai,  2003;  Meaburn  eta/.,  2007).  Similarly,  HSA19  is  also  shifted  to 
a  more  peripheral  position  in  tumors  of  a  subset  of  thyroid  cancer 
patients  (Murata  et  a/.,  2007).  Moreover,  HSA1 8  and  the  BCL2  gene 
are  shifted  to  a  more  peripheral  position  in  invasive  cervical  squa¬ 
mous  carcinoma  compared  with  the  apical,  but  not  basal,  layer  of 
nonneoplastic  squamous  epithelium  (Wiech  et  a/.,  2009).  In  both  an 
in  vitro  model  of  breast  cancer  and  breast  cancer  tissues,  -40%  of 
the  tested  genes  were  in  altered  locations  (Meaburn  and  Misteli, 
2008;  Meaburn  et  a/.,  2009).  Of  importance,  most  of  these  gene¬ 
repositioning  events  are  specific  to  the  oncogenic  state  and  do  not 
occur  in  benign  breast  disease  (Meaburn  et  a/.,  2009).  Not  only 
might  the  position  of  a  genomic  region  change  in  relation  to  the 
nuclear  edge  or  center  (radial  positioning)  in  cancer,  but  its  position 
relative  to  its  neighbors  might  also  be  affected.  For  instance,  the 
group  of  loci  that  are  in  close  spatial  proximity  to  the  IGFBP3  locus 
is  significantly  changed  in  breast  cancer  cell  lines  compared  with  a 
normal  control  (Zeitz  etal.,  2013). 

Spatial  genome  reorganization  in  cancer,  however,  is  not  a 
global  event.  The  positions  of  many  genomic  regions  remain  con¬ 
served  in  cancer  (Kozubek  et  a/.,  2002;  Parada  et  a/.,  2002;  Wiech 
et  a/.,  2005;  Meaburn  and  Misteli,  2008;  Meaburn  et  a/.,  2009; 
Timme  et  a/.,  2011).  For  example,  the  radial  position  of  HSA8  in 
pancreatic  cancer  (Timme  et  a/.,  201 1)  and  HSA10  in  most  thyroid 
cancers  (Murata  et  a/.,  2007)  is  similar  to  that  in  normal  tissue,  and 
the  position  of  the  majority  of  genes  analyzed  in  breast  cancer 
(Wiech  et  a/.,  2005;  Meaburn  and  Misteli,  2008;  Meaburn  et  a/., 
2009)  and  leukemia  (Kozubek  et  a/.,  2002)  are  unaffected.  Similarly, 
the  preferred  clustering  of  chromosomes  12,  14,  and  15  is  main¬ 
tained  in  a  mouse  lymphoma  cell  line  compared  with  cultured 
spleenocytes  (Parada  eta/.,  2002). 

Prostate  cancer  is  highly  prevalent.  In  the  United  States  alone, 
>220,000  new  cases  are  diagnosed  each  year,  with  one  in  seven 
men  developing  prostate  cancer  during  their  lifetime,  and  there 
are  >27,500  deaths  from  it  annually,  accounting  for  9%  of  all  male 
cancer  deaths  (Siegel  et  a/.,  201 5).  A  major  challenge  in  the  treat¬ 
ment  of  prostate  cancer  patients  is  the  difficulty  of  distinguishing 
indolent  from  aggressive  cancer.  As  a  result,  a  large  number  of 
patients  receive  unnecessary  treatment  for  cancers  that  will  likely 
remain  asymptomatic  during  their  lifetime  (Draisma  et  a/.,  2009; 
Cooperberg  et  a/.,  2010).  There  is  also  a  subset  of  patients  with 
high-risk  prostate  cancer  who  are  undertreated  and  are  not  receiv¬ 
ing  the  treatment  their  cancer  requires  to  ensure  disease-free  sur¬ 
vival  (Cooperberg  et  a/.,  2010).  Emerging  evidence  suggests  that 
changes  in  nuclear  architecture  features,  such  as  nuclear  shape  or 
global  levels  and  patterns  of  histone  modifications  and  nuclear 
lamin  proteins,  may  have  a  clinical  value  for  prostate  cancer  detec¬ 
tion  and  prognosis  (Veltri  and  Christudass,  2014).  However, 
whereas  spatial  (re)organization  of  the  genome  has  been  proposed 
to  have  diagnostic  potential  for  breast  cancer  (Meaburn  and 
Misteli,  2008;  Meaburn  et  a/.,  2009),  little  is  known  about  spatial 
genome  organization  in  prostate  cancer. 

We  performed  an  unbiased  screen  of  the  position  of  40  genes  in 
Formalin-fixed,  paraffin-  embedded  (FFPE)  human  prostate  tissues. 
Although  we  find  limited  variability  in  gene  positioning  patterns 
among  individuals  and  general  conservation  of  positioning  patterns 
in  cancer,  we  identified  three  genes  that  undergo  disease-specific 
repositioning  in  prostate  cancer. 


RESULTS 

Comprehensive  mapping  of  gene  positioning 
in  prostate  tissue 

Analogous  to  our  approach  with  regard  to  breast  cancer  (Meaburn 
etal.,  2009),  we  screened  the  positioning  patterns  of  40  genes  (Sup¬ 
plemental  Table  SI)  in  prostate  specimens  to  identify  genes  that 
reposition  in  prostate  cancer.  To  this  end,  to  map  comprehensively 
the  spatial  position  of  genes  in  normal  and  malignant  prostate  tis¬ 
sue,  we  combined  fluorescence  in  situ  hybridization  (FISH)  with 
quantitative  imaging  to  map  the  nuclear  positions  of  our  set  of 
genes  in  4-  to  5-pm-thick  FFPE  human  prostate  tissues  (Supplemen¬ 
tal  Table  S2).  The  panel  of  tissues  used  includes  30  prostate  carcino¬ 
mas,  five  hyperplasic  (nonmalignant  abnormality)  tissues,  and  29 
normal  or  normal  adjacent  to  tumor  (NAT)  tissues  (Supplemental 
Table  S2).  The  eight  genes  previously  identified  to  reposition  in 
breast  cancer  tissues  (Meaburn  et  a/.,  2009),  do  not  reposition  in 
prostate  (unpublished  data),  and  we  were  unable  to  identify  any  par¬ 
ticular  characteristics  of  repositioning  genes  in  breast  cancer  that 
predicted  repositioning  potential  (Meaburn  etal.,  2009).  Therefore 
we  screened  a  wide  range  of  genes  in  prostate  tissues,  which  in¬ 
cluded  many  of  the  most  important  genes  related  to  prostate  can¬ 
cer,  such  as  genes  reported  to  be  misregulated  in  prostate  cancer  or 
to  participate  in  translocations,  as  well  as  a  set  of  randomly  selected 
genes,  which  were  selected  independently  of  known  changes  in  ac¬ 
tivity  or  involvement  in  prostate  cancer  development  or  progres¬ 
sion.  The  test  set  included  genes  from  21  different  chromosomes 
(Supplemental  Table  SI).  To  map  the  nuclear  locations  of  individual 
genes  in  a  given  tissue,  we  determined  the  radial  position  of  each 
allele  in  typically  100-200  randomly  selected  epithelial  nuclei  on 
projections  of  three-dimensional  image  stacks  using  Euclidean  dis¬ 
tance  transform  (EDT)  after  automated  FISH  signal  detection,  as  pre¬ 
viously  described  (Meaburn  et  a/.,  2009;  see  Materials  and  Methods 
for  details).  For  each  candidate  gene,  the  positions,  normalized  to 
the  nuclear  radius  to  eliminate  nuclear  size  effects,  of  all  measured 
alleles  in  a  tissue  were  combined  and  represented  as  a  cumulative 
relative  radial  distribution  (RRD).  Statistical  difference  between  the 
cumulative  RRDs  of  different  samples  was  assessed  using  the  non- 
parametric  two-sample  one-dimensional  (ID)  Kolmogorov-Smirnov 
(KS)  test  as  previously  described  (Meaburn  etal.,  2009;  see  Materials 
and  Methods). 

To  screen  for  genes  that  specifically  reposition  in  prostate  cancer, 
we  used  a  previously  described  screening  strategy  (Meaburn  et  a/., 
2009).  Briefly,  gene  positions  were  first  compared  in  at  least  three 
normal  and  three  cancerous  prostate  tissues  (Table  1  and  Supple¬ 
mental  Figure  1A).  Genes  that  repositioned  in  the  majority  of  can¬ 
cers,  in  the  absence  of  a  large  variability  among  normal  tissues,  were 
then  screened  in  a  larger  number  of  tissues  (Table  1  and  Supplemen¬ 
tal  Figure  1  A).  Finally,  the  sensitivity  and  specificity  of  the  reposition¬ 
ing  events  of  the  top  hits  were  assessed  by  positioning  these  genes 
in  hyperplasic  tissues  and  determining  false-positive  and  false-neg¬ 
ative  rates  (Supplemental  Figure  1A;  see  later  discussion  for 
details). 

Conservation  of  positioning  patterns  in  morphologically 
normal  prostate  tissues 

It  is  unknown  how  similar  positioning  patterns  are  between  pros¬ 
tates  from  different  individuals,  yet  knowing  the  level  of  interindi¬ 
vidual  variability  in  the  positioning  pattern  of  a  given  gene  is  critical 
in  determining  whether  any  repositioning  detected  in  cancer  tissues 
is  specific  to  cancer  or  simply  a  product  of  interindividual  variability. 
Potential  variability  of  positioning  patterns  is  particularly  relevant  in 
the  prostate,  since  the  position  of  several  genes  has  been  shown  to 
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Gene 

Number  of 
tissues 

Percentage  (number)  of  SD 
cross-comparisons  among: 

Normal 

Cancer 

Individual 
normal  tissues 

Individual 
normal  and 
cancer  tissues 

AR 

7 

6 

9.5  (2/21) 

16.7  (7/42) 

BCL2 

4 

4 

0.0  (0/6) 

0.0  (0/16) 

BRCA2 

4 

3 

0.0  (0/6) 

0.0  (0/12) 

CCND1 

4 

4 

50.0  (3/6) 

37.5  (6/16) 

DCN 

5 

3 

20.0  (2/10) 

0.0  (0/15) 

EGFR 

6 

4 

0.0  (0/1 5) 

0.0  (0/24) 

ERG 

5 

3 

50.0  (5/10) 

40.0  (6/15) 

ESR2 

5 

4 

0.0  (0/10) 

0.0  (0/20) 

ETV1 

5 

3 

0.0  (0/10) 

0.0  (0/15) 

FGFR 1 

7 

5 

28.6  (6/21) 

22.9  (8/35) 

FGFR2 

6 

4 

33.3  (5/15) 

41.7  (10/24) 

FLU 

8 

14 

7.1  (2/28) 

60.7  (68/112) 

FOXA1 

4 

4 

0.0  (0/6) 

0.0(0/16) 

FUT4 

4 

3 

0.0  (0/6) 

0.0  (0/12) 

GREB 1 

4 

3 

16.7  (1/6) 

25.0  (3/12) 

HOXA9 

4 

3 

0.0  (0/6) 

8.3  (1/12) 

KLK3 

5 

4 

20.0  (2/10) 

30  (6/20) 

LMNA 

4 

3 

50.0  (3/6) 

16.7  (2/12) 

MATR3 

4 

3 

33.3  (2/6) 

0.0(0/12) 

MMP1 

3 

3 

33.3  (1/3) 

11.1  (1/9) 

MMP14 

3 

3 

33.3  (1/3) 

44.4  (4/9) 

MMP2 

8 

16 

7.1  (2/28) 

48.4  (62/128) 

MMP9 

10 

19 

17.8  (8/45) 

44.2  (84/190) 

NPM1 

3 

3 

33.3  (1/3) 

33.3  (3/9) 

NUMA1 

4 

4 

0.0  (0/6) 

12.5  (2/16) 

PADI4/6 

6 

6 

13.3  (2/15) 

5.6  (2/36) 

PTEN 

3 

3 

0.0  (0/3) 

22.2  (2/9) 

RAF1 

8 

5 

32.1  (9/28) 

30.0(12/40) 

SATB1 

5 

4 

20.0  (2/10) 

15.0  (3/20) 

SERPINB2 

4 

4 

16.7  (1/6) 

6.3  (1/16) 

SLC45A3 

5 

0 

0.0  (0/10) 

ND 

SP100 

5 

4 

0.0  (0/10) 

20.0  (4/20) 

SPDEF 

5 

4 

10.0  (1/10) 

5.0  (1/20) 

TGFB1 

4 

4 

50.0  (3/6) 

30.0(5/16) 

THBS1 

5 

4 

20.0  (2/10) 

30.0  (6/20) 

TIMP2 

6 

7 

53.3  (8/15) 

33.3  (14/42) 

TIMP3 

6 

4 

60.0  (9/15) 

58.3  (14/24) 

TMPRSS2 

6 

4 

60.0  (9/15) 

50.0  (12/24) 

VEGFA 

13 

7 

24.4  (19/78) 

47.3  (43/91) 

VIM 

5 

4 

70.0  (7/10) 

40.0  (8/20) 

SD,  significantly  different,  based  on  a  KS  test,  p  <  0.01 .  ND,  not  determined 
TABLE  1  Cross-comparisons  among  individual  tissues. 


be  sensitive  to  androgen  (Lin  et  al.,  2009;  Mani  et  a/.,  2009).  To  de¬ 
termine  the  degree  of  variability  in  gene  localization  among  indi¬ 
viduals,  we  compared  the  positioning  pattern  of  our  set  of  40  genes 
among  morphologically  normal  prostate  tissues.  For  most  of  the 
genes  analyzed,  the  radial  distribution  of  the  gene  among  normal 
prostate  tissues  was  similar  (Table  1,  Figures  1  and  2,  and  Supple¬ 
mental  Figures  SI  and  S2).  For  12  of  40  (30.0%)  genes,  the  RRDs 
were  statistically  identical  among  all  individuals  (p  >  0.01;  Table  1, 
Figures  1  and  2,  and  Supplemental  Figures  SI  and  S2).  An  addi¬ 
tional  20  genes  (50.0%),  although  not  identical  among  all  compari¬ 
sons,  were  very  similarly  positioned  among  most  individuals,  with 
<33.4%  of  cross-comparisons  being  statistically  significantly  differ¬ 
ent  (p  <  0.01 ;  Table  1 ,  Figures  1  and  2,  and  Supplemental  Figures  SI 
and  S2).  The  position  of  20%  of  genes  (8  of  40;  CCND1  [3  of  6  cross¬ 
comparisons],  ERG  [5  of  10],  LMNA  [3  of  6],  TGFBI  [3  of  6],  TIMP2 
[8  of  1 5],  TIMP3  [9  of  1 5],  TMPRSS2  [9  of  1 5]  and  VIM  [7 of  1 0])  were 
highly  variable  among  individuals,  with  50-70%  of  the  cross-com¬ 
parisons  among  normal  tissues  being  significantly  different  (Table  1 , 
Figures  1  and  2,  and  Supplemental  Figures  SI  and  S2). 

Because  the  position  of  a  gene  in  a  given  tissue  can  be  signifi¬ 
cantly  different  from  some  but  not  all  other  tissues,  as  an  additional 
measure  of  conservation  in  positioning  patterns  among  individuals, 
we  compared  the  RRD  of  each  gene  in  each  normal  tissue  to  a 
pooled  normal  distribution  (PND).  The  PND  is  used  as  a  standard¬ 
ized  "average"  position  of  a  gene  within  normal  tissues  and  was 
generated  by  combining  the  RRDs  from  all  normal  tissues  analyzed 
into  a  single  distribution  for  each  gene,  as  previously  described 
(Meaburn  eta/.,  2009;  see  Materials  and  Methods ).  Again  we  found 
a  high  level  of  similarity  among  individuals  (Table  2,  Figures  1  and  2, 
and  Supplemental  Figures  SI  and  S2).  For  26  of  the  40  (65.0%) 
genes,  no  individual's  normal  tissue  was  significantly  different  from 
the  PND,  and  an  additional  eight  (20.0%)  genes  had  only  a  single 
normal  tissue  significantly  different  to  the  PND  (7.7-25%  of  normal 
tissues;  Table  2,  Figures  1  and  2,  and  Supplemental  Figures  SI  and 
S2).  On  the  other  hand,  three  genes  (3  of  40;  7.5%) — FGFR1  (2  of  7 
normal  tissues),  TMPRSS2  (2  of  6),  and  RAF1  (3  of  8) — had  approxi¬ 
mately  one-third  (28.6-37.5%)  of  normal  tissues  significantly  differ¬ 
ent  from  the  PND,  and  an  additional  three  genes  (CCND1  [2  of  4], 
TIMP2  [3  of  6],  and  TIMP3  [4  of  6])  were  significantly  different  from 
the  PND  in  50.0-66.7%  of  normal  tissues  (Table  2  and  Supplemental 
Figures  SI  and  S2).  Taken  together,  these  data  demonstrate  that  the 
radial  positions  of  most  genes  are  conserved  among  prostate  tissue 
from  different  individuals.  However,  the  position  of  a  subset  of 
genes  is  variable  among  individuals. 

Identification  of  repositioned  genes  in  prostate  cancer 

Having  established  limited  variability  in  gene  positioning  among  in¬ 
dividuals,  we  screened  for  genes  that  reposition  in  prostate  cancer. 
We  compared  the  RRDs  of  39  genes  in  multiple  cancer  tissues  to 
individual  normal  tissues  and  to  the  PND  (Tables  1  and  2,  Figures  1 
and  2,  and  Supplemental  Figures  SI  and  S2).  The  position  of  nine 
genes  (23.1  %)  was  indistinguishable  in  all  cross-comparisons  among 
individual  normal  and  cancer  tissues,  and  an  additional  19  genes 
(48.7%)  had  only  limited  variability  between  normal  and  cancer  tis¬ 
sues,  with  5.0-33.3%  of  cross-comparisons  reaching  significance. 
Nine  genes  (23.1%;  CCND1  [6  of  16  comparisons],  ERG  [6  of  15], 
VIM  [43  of  91  ],  FGFR2  [1 0  of  24],  MMP9  [84  of  1 90],  MMP14  [4  of  9], 
VEGFA  [43  of  91  ],  MMP2  [62  of  1 28],  and  TMPRSS2  [1 2  of  24])  were 
differently  positioned  in  37.5-50.0%  of  cross-comparisons.  Two 
genes  (5.1%;  TIMP3  [14  of  24]  and  FLU  [68  of  112])  were  reposi¬ 
tioned  in  -60%  of  cross-comparisons  between  normal  and  malig¬ 
nant  prostate  tissues  (Table  1,  Figures  1  and  2,  and  Supplemental 
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FIGURE  1 :  Gene-specific  spatial  reorganization  of  the  genome  in  prostate  cancer.  (A,  B)  Gene  loci  were  detected  by 
FISH  in  FFPE  prostate  tissue  sections.  Blue,  4',6-diamidino-2-phenylindole  nuclear  counterstain.  Projected  image  stacks. 
(A)  EGFR  (red)  and  SPDEF  (green)  gene  loci,  in  a  malignant  prostate  tissue.  Bar,  10  pm.  (B)  FLU,  MMP9,  and  MMP2  gene 
loci  (red)  in  normal  and  cancer  tissues.  Bar,  5  pm.  (C)  Cumulative  RRDs  for  the  indicated  genes  in  prostate  cancer  (red), 
normal  prostate  tissues  (black),  and  the  pooled  normal  distribution  (PND,  blue).  The  positioning  patterns  of  some  but 
not  all  genes  are  different  in  prostate  cancer  compared  with  normal  tissues.  RRP,  relative  radial  position. 


Figures  SI  and  S2).  For  most  genes,  the  number  of  significantly  dif¬ 
ferent  cross-comparisons  between  normal  and  cancer  specimens 
was  not  greater  than  that  of  the  cross-comparisons  among  normal 
tissues,  suggesting  that  the  repositioning  was  not  cancer  specific 
(Table  1).  For  example,  whereas  TIMP3  was  differently  positioned  in 
58.3%  (14  of  24)  of  the  cross-comparisons  between  individual  nor¬ 


mal  and  cancer  tissues,  60.0%  (9  of  15)  of  cross-comparisons  were 
statistically  different  when  normal  tissues  were  cross-compared  with 
each  other  (Table  1  and  Supplemental  Figure  S2).  Taking  into  con¬ 
sideration  the  degree  of  positioning  changes  in  all  cross-compari¬ 
sons  and  the  interindividual  variability  of  candidates,  we  identified 
three  genes — MMP2,  MMP9,  and  FLU — that  exhibited  robust 
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FIGURE  2:  Gene-specific  repositioning  in  prostate  cancer.  Heat  maps  representing  the  pairwise 
statistical  comparisons  of  radial  positioning  patterns  of  indicated  genes  among  tissues  using  the 
two-sample  ID  KS  test.  Although  positioning  patterns  are  statistically  similar  (blue,  brown) 
among  most  normal  tissues  (N1-N28;  see  Supplemental  Table  S2),  they  can  be  divergent  (red, 
orange,  yellow)  in  cancer  tissues  (C1-C24;  see  Supplemental  Table  S2).  Black  and  white  asterisks 
indicate  a  cross-comparison  between  a  normal  and  cancer  specimens  from  the  same  individual. 
PND,  pooled  normal  distribution. 


differential  radial  positioning  between  nor¬ 
mal  and  cancer  tissues.  For  these  genes,  the 
percentage  of  cross-comparisons  between 
normal  and  cancer  tissues  was  higher  by 
>25%  than  when  normal  tissues  were  com¬ 
pared  with  each  other  (48.4  vs.  7.1%,  44.2 
vs.  17.8%,  and  60.7  vs.  7.1%,  respectively; 
Table  1  and  Figure  2). 

Repositioning  of  FL/7,  MMP9,  and  MMP2 
in  prostate  cancers  was  confirmed  by  analy¬ 
sis  of  their  distributions  in  individual  cancer 
samples  to  their  PNDs.  Most  genes  in  the 
test  set  displayed  limited  repositioning  in 
prostate  cancer  tissues  compared  with  their 
PNDs.  Two-thirds  (26)  of  genes  were  reposi¬ 
tioned  in  0-33.3%  of  cancer  tissues  com¬ 
pared  with  the  PND,  and  six  genes  reposi¬ 
tioned  in  40-50%  of  cancers  (Table  2, 
Figures  1  and  2,  and  Supplemental  Figures 
SI  and  S2).  As  observed  in  the  cross-com¬ 
parison  analysis,  FLU,  MMP9,  and  MMP2 
showed  robust  repositioning  in  prostate 
cancer,  and  their  distributions  were  distinct 
from  their  PNDs  in  92.9%  (13  of  14  of  cancer 
tissues),  68.4%  (13  of  19),  and  56.3%  (9  of 
16)  of  cancers,  respectively  (Table  2  and 
Figures  1  and  2).  Although  an  additional 
four  genes  were  also  repositioned  in  >50% 
of  cancers  compared  with  the  PND  (VEGF, 
57.1%  [4  of  7];  MMP14,  66.7%  [2  of  3]; 
NPM1,  66.7%  [2  of  3];  and  TMPRSS2,  75% 
[3  of  4];  Table  2  and  Supplemental  Figures 
SI  and  S2),  we  classified  these  genes  as 
low-priority  hits  and  did  not  pursue  them 
further  because  the  percentage  of  cross¬ 
comparisons  between  normal  and  cancer 
tissues  was  not  significantly  higher  than  the 
variability  detected  among  normal  tissues 
and  visual  inspection  of  the  cumulative  fre¬ 
quency  distribution  plots  revealed  that  only 
a  small  number  of  the  distributions  in  cancer 
tissues  fell  outside  the  range  of  the  normal 
tissues  (Supplemental  Figure  SI). 

In  the  vast  majority  (96  of  125;  76.8%)  of 
cases,  there  was  a  similar  repositioning  be¬ 
havior  when  a  cancer  was  compared  with  its 
matched  NAT  or  with  the  PND  (Figure  2  and 
Supplemental  Figure  S2).  This  observation 
suggests  that  repositioning,  if  present  in  a 
patient,  is  limited  to  the  cancer  tissue  itself 
and  that  comparison  to  the  PND  is  suitable 
to  assess  whether  a  gene  is  repositioned  in 
cancer,  even  when  no  normal  tissue  from 
the  same  individual  is  available. 

We  were  unable  to  define  any  feature  of 
a  prostate  cancer  that  predicted  the  likeli¬ 
hood  of  a  gene  repositioning  within  it.  The 
propensity  of  the  three  marker  genes  to  re¬ 
position  in  a  given  cancer  was  not  linked  to 
changes  in  copy  number  (Supplemental 
Table  S3),  nor  was  the  variation  in  their 
positioning  patterns  correlated  with  the 
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Percentage  (number)  of  SD 
cross-comparisons  between: 

Gene 

Individual  normal 
tissues  and  pooled 
normal 

Individual  cancer 
tissues  and  pooled 
normal 

AR 

0.0  (0/7) 

33.3  (2/6) 

BCL2 

0.0  (0/4) 

0.0  (0/4) 

BRCA2 

0.0  (0/4) 

33.3  (1/3) 

CCND1 

50.0  (2/4) 

25.0  (1/4) 

DCN 

0.0  (0/5) 

0.0  (0/3) 

EGFR 

0.0  (0/6) 

0.0  (0/4) 

ERG 

20.0(1/5) 

33.3  (1/3) 

ESR2 

0.0  (0/5) 

0.0  (0/4) 

ETV1 

0.0  (0/5) 

0.0  (0/3) 

FGFR1 

28.6  (2/7) 

40.0  (2/5) 

FGFR2 

16.7  (1/6) 

50.0  (2/4) 

FLI1 

12.5  (1/8) 

92.9  (13/14) 

FOXA1 

0.0  (0/4) 

0.0  (0/4) 

FUT4 

0.0  (0/4) 

0.0  (0/3) 

GREB1 

25.0  (1/4) 

0.0  (0/3) 

HOXA9 

0.0  (0/4) 

33.3  (1/3) 

KLK3 

20.0  (1/5) 

25.0  (1/4) 

LMNA 

0.0  (0/4) 

33.3  (1/3) 

MATR3 

0.0  (0/4) 

0.0  (0/3) 

MMP1 

0.0  (0/3) 

0.0  (0/3) 

MMP14 

0.0  (0/3) 

66.7  (2/3) 

MMP2 

0.0  (0/8) 

56.3  (9/16) 

MMP9 

10.0  (1/10) 

68.4  (13/19) 

NPM1 

0.0  (0/3) 

66.7  (2/3) 

NUMA1 

0.0  (0/4) 

50.0  (2/4) 

PADI4/6 

0.0  (0/6) 

16.7  (1/6) 

PTEN 

0.0  (0/3) 

0.0  (0/3) 

RAF1 

37.5  (3/8) 

40.0  (2/5) 

SATB1 

0.0  (0/5) 

25.0  (1/4) 

SERPINB2 

0.0  (0/4) 

0.0  (0/4) 

SLC45A3 

0.0  (0/5) 

ND 

SP100 

0.0  (0/5) 

25.0  (1/4) 

SPDEF 

0.0  (0/5) 

0.0  (0/4) 

TGFB1 

25.0(1/4) 

0.0  (0/4) 

THBS1 

0.0  (0/5) 

50.0  (2/4) 

TIMP2 

50.0  (3/6) 

28.6  (2/7) 

TIMP3 

66.7  (4/6) 

50.0  (2/4) 

TMPRSS2 

33.3  (2/6) 

75.0  (3/4) 

VEGFA 

7.7  (1/13) 

57.1  (4/7) 

VIM 

0.0  (0/5) 

25.0  (1/4) 

SD,  significantly  different,  based  on  a  KS  test,  p  <  0.01 .  ND,  not  determined. 

TABLE  2  Comparison  of  individual  tissues  to  a  pooled  normal 
distribution. 


RRP 


FIGURE  3:  Gene  repositioning  in  prostate  cancer  is  not  linked  to 
prognostic  status.  Cumulative  RRDs  are  color  coded  according  to  the 
prognostic  status  of  the  cancer:  blue,  indolent  prostate  cancers 
(Gleason  score  <6,  without  metastasis);  green,  intermediate  cancers 
(Gleason  score  7,  without  metastasis);  red,  aggressive  cancers 
(Gleason  score  >8,  and/or  metastasis).  The  pooled  normal  distribution 
(black)  is  included  for  comparison.  RRP,  relative  radial  position. 

clinicopathologically  defined  aggressiveness  of  the  cancer  (Supple¬ 
mental  Table  S2  and  Figure  3). 

Cancer  specificity  of  FLI1  and  MMP9  repositioning 

Given  that  loci  can  alter  their  nuclear  location  in  diseases  other  than 
cancer  (Borden  and  Manuelidis,  1988;  Meaburn  etal.,  2007,  2009), 
we  sought  to  determine  whether  the  repositioning  of  FLU,  MMP9, 
and  MMP2  was  specific  to  cancer  or  was  a  more  general  feature  of 
prostate  disease.  We  compared  the  radial  positioning  patterns  of 
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Gene 

Percentage  (number)  of  SD 
cross-comparison  among: 

Individual 

hyperplasia 

tissues 

Individual 
normal  and 
hyperplasia 
tissues 

Individual 
hyperplasia 
tissues  and 
pooled  normal 

FLU 

10.0  (1/10) 

1 0.0  (4/40) 

0.0  (0/5) 

MMP9 

0.0  (0/10) 

12.0  (6/50) 

0.0  (0/5) 

MMP2 

0.0(0/10) 

45.0  (18/40) 

80.0  (4/5) 

SD,  significantly  different,  based  on  a  two-sample  1 D  KS  test,  p  <  0.01 . 


TABLE  3  Comparison  of  gene  positioning  in  benign  tissues. 

these  genes  in  five  benign  hyperplastic  tissues  with  the  positioning 
patterns  of  normal  prostate  tissues  and  among  multiple  hyperplasia 
tissues  (Table  3  and  Figure  4).  MMP9  and  MMP2  were  positioned 
identically  in  all  hyperplastic  tissues  (Table  3  and  Figure  4).  The  posi¬ 
tion  of  FLH  is  similarly  conserved  among  hyperplastic  tissues,  with 


only  a  single  cross-comparison  reaching  significance  (1  of  10;  Table 
3  and  Figure  4).  FLI1  and  MMP9  did  not  reposition  in  hyperplasia 
compared  with  normal  tissues,  since  the  positioning  of  both  genes 
was  indistinguishable  from  the  PND  in  all  five  hyperplasia  tissues 
(Table  3  and  Figure  4),  and  only  10%  (4  of  40)  and  12%  (6  of  50)  of 
the  cross-comparisons  among  individual  normal  tissues  reached  sig¬ 
nificance,  respectively.  These  findings,  combined  with  the  low  repo¬ 
sitioning  rate  in  normal  tissue,  suggests  that  FLU  and  MMP9  reposi¬ 
tioning  is  specific  to  cancer.  In  contrast,  compared  with  normal 
tissues,  MMP2  was  repositioned  in  both  prostate  cancer  and  hyper¬ 
plasia.  MMP2  was  in  a  significantly  different  position  in  80%  (4  of  5) 
of  the  hyperplastic  tissues  compared  with  the  PND  and  in  45%  (18 
of  40)  of  cross-comparisons  between  hyperplasic  and  normal  tissues 
(Table  3  and  Figure  4),  suggesting  that  the  repositioning  of  MMP2  is 
common  in  prostate  disease  and  not  limited  to  malignancy.  We  had 
access  to  both  hyperplastic  and  cancer  tissue  from  the  same  pros¬ 
tate  (B1  and  C3,  respectively;  Supplemental  Table  S2).  In  keeping 
with  the  comparisons  between  hyperplasia  and  normal  tissue,  FLU 
and  MMP9  occupied  significantly  different  positions  in  the  cancer 
and  hyperplasic  tissue  (p  =  0.004  and  0.002,  respectively),  whereas 
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FIGURE  4:  Gene  positioning  in  benign  disease.  Positions  of  indicated  genes  were  compared  between  hyperplasia 
(B1-B5)  and  normal  prostate  tissue  (N1-N25).  (A)  Cumulative  RRDs  for  the  indicated  genes  in  hyperplasia  (red),  normal 
tissues  (black),  and  the  pooled  normal  distribution  (PND;  blue).  (B)  Pairwise  statistical  comparisons  of  RRDs  between 
hyperplasic  tissues  and  normal  tissues  and  among  hyperplasic  tissues,  using  the  two-sample  1 D  KS  test.  All  three  genes 
are  similarly  positioned  between  hyperplasic  tissues  and  only  MMP2  repositions  in  hyperplasic  compared  with  normal 
tissues.  RRP,  relative  radial  position. 
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Gene 

Normal-tissue 
false  positives 

Benign-tissue 
false  positives 

Total  false 
positives 

FLU 

12.5%  (1/8) 

0.0%  (0/5) 

7.7%  (1/13) 

MMP9 

10.0%  (1/10) 

0.0%  (0/5) 

6.7%  (1/15) 

MMP2 

0.0%  (0/8) 

80.0%  (4/5) 

30.8%  (4/13) 

The  percentage  (number)  of  tissues  that  give  a  false-positive  result.  A  false 
positive  is  scored  when  a  gene  has  a  statistically  significant  different  RRD  in  a 
normal  or  hyperplastic  tissue  compared  with  the  pooled  normal  (p  <  0.01 ;  KS 
test). 

TABLE  4:  False-positive  rates. 

MMP2  was  in  a  similar  position  in  the  two  disease  states  (p  =  0.516) 
for  this  individual. 

High-confidence  detection  of  prostate  cancers  using 
multiplexed  positioning  biomarkers 

Because  FLU  and  MMP9  repositioning  occurs  predominantly  in  can¬ 
cer  tissues,  we  determined  their  false-positive  and  false-negative 
rates  as  a  preliminary  step  toward  assessing  their  potential  as  bio¬ 
markers  for  prostate  cancer.  A  false  positive  was  defined  as  reposi¬ 
tioning  (p  >  0.01;  KS  test)  of  the  marker  gene  in  a  nonmalignant 
(normal  or  hyperplastic)  tissue  compared  with  the  PND,  thus  incor¬ 
rectly  classifying  the  tissue  as  cancer.  Conversely,  a  false  negative 
was  defined  as  lack  of  repositioning  (p  <  0.01 )  of  the  marker  gene  in 
a  cancer,  leading  to  incorrect  classification  of  the  tissue  as  nonmalig¬ 
nant.  As  expected  due  to  its  high  rate  of  repositioning  in  hyperplasia, 
MMP2  had  a  high  false-positive  rate  (30.8%  [4  of  13  nonmalignant 
tissues];  Figures  2  and  4  and  Table  4).  This  finding,  combined  with  its 
high  false-negative  rate  (43.8%  [7  of  16  cancer  tissues];  Figure  2  and 
Table  5),  suggests  that  MMP2  would  not  be  a  useful  biomarker  for 
prostate  cancer.  In  contrast,  FLU  had  low  false-negative  and  false¬ 
positive  rates  (7.1%  [1  of  1 4  cancer  tissues]  and  7.7%  [1  of13nonma- 
lignant  tissues],  respectively;  Figures  2  and  4  and  Tables  4  and  5). 
MMP9  had  a  fairly  high  false-negative  rate  (31.6%  [6  of  19  cancer 
tissues];  Figure  2  and  Table  5)  but  a  low  false-positive  rate  (6.7%  [1  of 
1 5  nonmalignant  tissues];  Figures  2  and  4  and  Table  4). 

None  of  the  genes  we  tested  repositioned  in  all  prostate  cancer 
tissues.  However,  at  least  one  repositioned  gene  was  present  in  ev¬ 
ery  prostate  cancer  tissue  analyzed  (30  of  30  cancers;  Figure  2  and 
Supplemental  Figure  S2).  Given  the  low  false-positive  rate  of  FLU 
and  MMP9,  we  explored  the  possibility  of  using  these  two  genes  in 
a  multiplex  format  for  the  detection  of  cancer  samples.  When  ana¬ 
lyzed  together,  FLU  or  MMP9,  or  both,  repositioned  in  all  (11  of  1 1) 
cancers  (Figure  2  and  Supplemental  Table  S4).  Consequently,  multi¬ 
plexing  FL/7  and  MMP9  reduced  the  false-negative  rate  to  0%  (0  of 
1 1 ;  Table  5).  We  conclude  that  in  combination,  FL/7  and  MMP9  are 
strong  potential  biomarkers  for  prostate  cancer. 

DISCUSSION 

This  study  was  designed  to  identify  gene  loci  that  undergo  reposi¬ 
tioning  during  prostate  carcinogenesis.  Spatial  mapping  of  the  ra¬ 
dial  position  of  40  genes  by  FISH  identified  two  genes  that  robustly 
reposition  in  prostate  cancer  and  a  third  gene  that  repositions  in 
both  prostate  cancer  and  benign  prostate  disease.  The  vast  majority 
of  analyzed  genes,  however,  did  not  undergo  significant  cancer- 
specific  repositioning,  pointing  to  considerable  conservation  of  spa¬ 
tial  genome  organization  in  prostate  cancer. 

In  keeping  with  previous  studies  on  positioning  of  select  genes 
and  of  whole  chromosomes  in  brain  (Borden  and  Manuelidis,  1988), 
pancreas  (Timme  et  a/.,  2011),  thyroid  (Murata  et  al.,  2007),  and 


Gene(s) 

False-negative  rate 

Single  gene 

FL/7 

7.1%  (1/14) 

MMP9 

31.6%  (6/19) 

MMP2 

43.8%  (7/16) 

Multiplexed  genes 

FL/7  and  MMP9 

0.0%  (0/11) 

FLU  and  MMP2 

9.1%  (1/11) 

MMP9  and  MMP2 

28.6%  (4/14) 

The  percentage  (number)  of  tissues  that  give  a  false-negative  result.  A  false 
negative  is  scored  when  the  RRD  of  a  gene  in  a  cancer  tissue  is  similar  to  that 
of  the  pooled  normal  distribution  (KS  test,  p  >  0.01).  For  multiplexed  genes,  a 
false  positive  is  scored  when  neither  gene  in  the  pair  is  repositioned  compared 
with  the  pooled  normal  distribution  (p  >  0.01 ,  KS  test).  Only  tissues  in  which 
both  genes  of  the  pair  were  positioned  are  included. 

TABLE  5:  False-negative  rates  for  single  and  multiplexed  genes. 


breast  (Wiech  et  al.,  2005;  Meaburn  et  al.,  2009),  we  find  that  the 
spatial  positioning  of  most  genes  tested  in  normal  prostate  tissue  is 
generally  well  conserved  among  individuals.  Of  40  genes  analyzed, 
only  three  (CCND1,  TIMP2,  and  TIMP3 )  were  highly  variable  among 
individuals.  A  prerequisite  of  a  useful  cancer  biomarker  is  conserva¬ 
tion  among  individuals  in  nonmalignant  tissues  because  such  vari¬ 
ability  leads  to  low  sensitivity  and  specificity  of  the  marker.  There¬ 
fore  some  genes  will  not  be  suitable  for  diagnostic  purposes  due  to 
variable  positioning  patterns  among  individuals.  The  variability  in 
the  position  of  some  genes  in  normal  prostate  tissue  is  higher  than 
previously  detected  for  15  genes  in  morphologically  normal  breast 
tissue  (Meaburn  et  al.,  2009).  The  reasons  for  this  difference  be¬ 
tween  the  two  tissue  types  are  not  known.  One  possibility  is  that 
some  glands  within  normal  prostate  tissues  have  preneoplastic  fea¬ 
tures.  For  example,  proliferative  inflammatory  atrophy,  which  is  re¬ 
garded  as  a  histologically  distinctive  pattern  of  benign  glands,  has 
some  of  the  molecular  features  of  prostate  neoplasia  (De  Marzo 
eta/.,  1999).  In  contrast,  there  is  no  reported  counterpart  lesion  of 
mammary  glands.  Another  possibility  is  that  these  prostate  glands 
could  have  included  low-grade  prostatic  intraepithelial  neoplasia, 
which  is  challenging  to  recognize  in  dark-field  FISH  preparations.  In 
addition,  some  genes  could  be  more  susceptible  to  interindividual 
variations  in  their  positioning  patterns,  which  could  be  either  a  gen¬ 
eral  feature  of  the  locus  or  tissue  specific.  For  example,  the  position 
of  CCND1  was  similar  among  normal  breast  tissues  (Meaburn  eta/., 
2009)  but  varied  among  individuals  in  the  prostate,  pointing  to  a 
potential  tissue-specific  driver  to  interindividual  positioning  pat¬ 
terns.  Because  CCND1  is  a  cell  cycle  regulator  and  epithelial  cells 
are  largely  quiescent  in  normal  breast  and  prostate  tissues,  a  direct 
link  between  CCND1  function  and  tissue  specificity  seems  unlikely. 

We  identified  three  genes  that  consistently  reposition  in  prostate 
cancer.  The  repositioning  of  FL/7  and  MMP9  was  specific  to  cancer, 
as  it  did  not  occur  in  benign  disease,  in  line  with  earlier  observations 
in  breast,  where  gene  repositioning  was  generally  limited  to  cancer¬ 
ous  tissue  (Meaburn  et  al.,  2009).  In  contrast,  MMP2  repositioned  in 
cancer  as  well  as  in  hyperplasic  prostate  tissue.  Our  findings  are  in 
line  with  previous  studies  in  other  types  of  cancer,  for  which  reposi¬ 
tioning  of  some  loci  and  concurrent  conservation  in  the  positioning 
patterns  of  others  have  been  reported  (Croft  et  al.,  1999;  Parada 
et  al.,  2002;  Cremer  et  al.,  2003;  Wiech  et  al.,  2005,  2009;  Murata 
et  al.,  2007;  Meaburn  and  Misteli,  2008;  Meaburn  et  al.,  2009; 
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Harewood  etal.,  2010).  Of  interest,  FLU  and  MMP9  also  reposition 
in  breast  cancer  (unpublished  data),  suggesting  that  they  are  multi¬ 
tissue  markers  of  cancer.  Not  all  genes  that  reposition  in  cancer  are 
general  markers  of  a  malignant  state,  however,  since  an  additional 
eight  genes  that  reposition  in  breast  cancer  (Meaburn  et  a/.,  2009) 
do  not  reposition  in  prostate  cancer  (unpublished  data). 

We  did  not  identify  any  common  features  shared  among  reposi¬ 
tioning  genes  in  prostate  cancer.  For  example,  the  genes  that  repo¬ 
sition  map  to  different  chromosomes  (FLI 7,  HSA1 1 ;  MMP9,  HSA20; 
and  MMP2,  HSA16).  The  repositioning  of  FLI 7,  at  least,  is  unlikely  to 
reflect  whole-chromosome  movements,  since  of  five  genes  located 
on  HSA11  ( FLU ,  CCND1,  FUT4,  MMP1,  and  NUMA1),  only  FL/7  ro¬ 
bustly  repositioned  in  prostate  cancer.  In  addition,  biological  func¬ 
tion  or  known  relevance  to  tumorigenesis  of  genes  does  not  predict 
repositioning  behavior.  Many  of  the  genes  we  positioned  that  are 
commonly  implicated  in  prostate  cancer,  such  as  KLK3  (prostate- 
specific  antigen),  AR,  PTEN,  TGFB1,  VEGFA,  and  BCL2  (Bok  and 
Small,  2002;  Ferte  et  a/.,  2010;  Lawrence  et  a/.,  2010;  Shen  and 
Abate-Shen,  2010),  and  the  commonly  translocated  genes  in  pros¬ 
tate  cancer,  the  androgen-regulated  TMPRSS2  and  the  ETS  tran¬ 
scription  factors  ERG  and  ETV7  (Tomlins  et  a/.,  2005),  do  not  spa¬ 
tially  reposition  in  prostate  cancers.  Yet  the  ETS  transcription  factor 
FL/7,  which  forms  rare  translocations  in  prostate  cancer  (Paulo  et  a/., 
2012),  does.  MMP9  and  MMP2,  which  belong  to  the  same  class  of 
matrix  metallopeptidases  (MMPs)  and  are  involved  in  the  break¬ 
down  of  extracellular  matrix,  which  is  important  to  cancer-  associ¬ 
ated  processes  such  as  angiogenesis  and  metastasis  (Egeblad  and 
Werb,  2002),  reposition  in  prostate  cancer,  and  yet  the  potential 
functional  relevance  of  the  cancer-association  gene  repositioning  is 
unclear.  Although  an  increase  in  MMP9  and  MMP2  expression  has 
been  linked  to  poor  prognosis  in  prostate  cancer,  both  genes  are 
predominantly  expressed  by  the  stromal  cells  and  not  by  the  tumor 
cells  (Egeblad  and  Werb,  2002).  Other  MMP  family  members,  such 
as  M/V7P74,  and  the  genomic  loci  containing  a  cluster  of  MMP 
genes,  including  MMP1,  did  not  reposition.  Moreover,  genes  cod¬ 
ing  the  MMP-regulating  proteins  TIMP2,  TIMP3  (Egeblad  and  Werb, 
2002),  and  SPDEF  (Johnson  etal.,  2010),  another  ETS  transcription 
factor,  do  not  reposition  in  prostate  cancer,  further  reducing  a  link 
between  function  and  cancer-associated  gene  repositioning.  A  lack 
of  a  link  between  gene  function  and  an  altered  radial  position  in 
cancer  is  in  line  with  multiple  studies  that  find  no  correlation  be¬ 
tween  changes  in  gene  expression  and  changes  in  radial  positioning 
patterns,  including  a  cell  culture  model  of  early  breast  cancer 
(Williams  et  a/.,  2006;  Meaburn  and  Misteli,  2008;  Takizawa  et  a/., 
2008b;  Morey  et  a/.,  2009;  Harewood  et  a/.,  2010;  Shachar  et  a/., 
2015).  Although  we  focused  on  radial  positioning  patterns  to  iden¬ 
tify  cancer-specific  spatial  genome-repositioning  events,  relative 
positioning  of  loci  to  other  genomic  loci  or  nuclear  landmarks,  such 
as  nuclear  bodies,  may  be  a  promising  alternative  parameter  to 
characterize  the  spatial  reorganization  of  gene  loci  in  cancer. 

Repositioning  did  not  correlate  with  gene  copy  number  or  the 
presence  of  translocations.  For  example,  FL/7  repositioned  in  both 
cancers  with  and  without  amplified  FL/7,  and  we  found  FL/7  reposi¬ 
tioning  in  13  of  14  prostate  cancers,  yet  the  SLC45A3-FL/7  translo¬ 
cation  occurs  in  only  1  in  200  prostate  cancers  (Paulo  et  a/.,  2012). 
Although  it  is  possible  that  translocations  could  affect  the  position 
of  genomic  regions  beyond  the  translocating  region,  even  on  chro¬ 
mosomes  not  involved  in  the  translocation,  our  data  demonstrate 
that  in  the  background  of  genetic  instability,  it  is  still  possible  to  use 
gene  positioning  for  cancer  diagnostics. 

As  previously  observed  for  breast  cancer  (Meaburn  et  a/.,  2009), 
we  find  that  combinatorial  analysis  of  multiple  genes  results  in 


detection  of  cancer  with  high  accuracy.  Whereas  neither  MMP9  nor 
FL/7  alone  was  repositioned  in  all  prostate  tumor  samples,  at  least 
one  of  the  two  was  repositioned  in  all  cancer  samples  analyzed.  Our 
observation  of  cancer-specific  repositioning  of  these  two  genes, 
combined  with  their  relatively  low  false-positive  detection  rate, 
makes  MMP9  and  FL/7  strong  candidates  for  combinatorial  use  in 
cancer  diagnosis. 

MATERIALS  AND  METHODS 

Tissues 

To  detect  individual  genes,  FISH  was  performed  on  a  panel  of  4-  to 
5-pm-thick  FFPE  human  prostate  tissue  sections,  which  consisted  of 
30  prostate  cancers,  29  normal  and  NAT  prostate  tissues,  and  five 
hyperplasic  tissues  (Supplemental  Table  S2).  Prostate  tissues  were  ob¬ 
tained  from  the  University  of  Washington  (Seattle,  WA)  under  the 
guidelines  and  approval  of  the  Institutional  Review  Board  of  the 
University  of  Washington  (#00-3449).  These  specimens  were  reviewed 
by  a  single  genitourinary  pathologist  (L.D.T.).  Additional  tissues  were 
purchased  from  US  Biomax  (Rockville,  MD)  and  Imgenex  (San  Diego, 
CA;  Supplemental  Table  S2).  All  specimens  were  deidentified. 

FISH 

FISH  probes  were  produced  by  using  nick  translation  to  label  DNA 
purified  from  bacterial  artificial  chromosome  clones  (BACPAC 
Resources  Center  [Oakland,  CA];  Supplemental  Table  SI)  with  either 
biotin-  or  digoxigenin-conjugated  dUTPs  (Roche,  Indianapolis,  IN), 
as  detailed  in  Meaburn  (2010).  An  identical  dual-probe  FISH  proce¬ 
dure  was  performed  on  all  specimens  as  previously  described 
(Meaburn  et  a/.,  2009),  with  the  exceptions  that  1)  the  60°C  slide 
baking  step,  before  xylene  (Avantor  Performance  Materials,  Center 
Valley,  PA)  treatment,  was  omitted,  2)  the  time  the  tissue  sections 
were  incubated  in  0.25  mg/ml  Proteinase  K  (Sigma-Aldrich,  St. 
Louis,  MO)  was  increased  to  15-20  min,  and  3)  0.5  mg/ml  Protein¬ 
ase  K  was  often  required  for  single-tissues  slides  obtained  from  Bio¬ 
max  (but  not  tissue  microarray  slides). 

Image  acquisition  and  FISH  analysis 

Image  accusation  was  performed  essentially  as  described  in 
Meaburn  et  a/.  (2009).  Briefly,  tissue  sections  were  imaged  with  an 
1X70  (Olympus,  Waltham,  MA)  DeltaVision  (Applied  Precision-GE 
Healthcare  Bio-Sciences,  Pittsburgh,  PA)  system,  using  a  60x/1.42 
numerical  aperture  oil  objective  lens  (Olympus)  and  an  auxiliary 
magnification  of  1 .5.  Image  stacks  were  acquired  with  a  0.5-pm  step 
interval  along  the  z-axis.  If  mixed  morphologies  were  present  in  a 
tissue  section,  before  imaging,  the  tissue  was  visually  inspected  at 
low  resolution  (lOx  lens;  Olympus)  to  identify  malignant,  normal,  or 
hyperplastic  regions.  In  these  cases,  hematoxylin  and  eosin-stained 
tissue  slides  that  had  been  marked  by  a  pathologist  (L.D.T.)  were 
used  as  a  guide  to  ensure  correct  classification  of  different  regions 
of  a  tissue.  Image  stacks  were  deconvolved  and  converted  into  max¬ 
imum  intensity  projections  using  SoftWoRx  (Applied  Precision). 

Image  analysis  was  performed  as  previously  described  (Meaburn 
et  a!.,  2009).  Images  were  contrast  enhanced  based  on  visual  inspec¬ 
tion,  and  individual  nuclei  were  manually  delineated  in  Photoshop 
7.0  (Adobe,  San  Jose,  CA),  with  each  nucleus  saved  in  a  separate 
image  file.  To  determine  the  relative  radial  positioning  of  a  gene  in  a 
given  tissue,  91-263  randomly  selected  interphase  epithelial  nuclei 
were  run  though  a  custom-made  image  analysis  software  package 
(P.  Gudla  and  S.  Lockett,  National  Cancer  Institute,  Bethesda,  MD; 
Meaburn  etal.,  2009)  in  MATLAB  (MathWorks,  Natick,  MA)  using  the 
PRTools  and  DIPImage  toolboxes  (Delft  University  of  Technology, 
Delft,  Netherlands).  The  software  automatically  identified  FISH 
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signals  (>99%  of  FISH  signals  detected,  with  a  false-positive  rate  of 
<1%;  Takizawa  eta/.,  2008a)  and  determined  both  the  binary  EDT of 
the  geometric  gravity  center  of  each  FISH  signal  and  the  number  of 
FISH  signals  in  a  nucleus  for  both  genes  visualized  by  FISH.  In  EDT, 
each  pixel  in  a  nucleus  is  assigned  a  value  that  equals  the  shortest 
distance  to  the  nuclear  edge.  EDT  values  are  then  normalized  to  the 
maximum  EDT  value  of  a  given  nucleus  to  account  for  variations  in 
nuclear  size.  Using  this  method,  no  assumption  regarding  nuclear 
shape  or  size  is  made  when  determining  the  radial  position  of  a 
gene,  allowing  accurate  comparisons  between  tissues  even  if  there 
are  differences  or  irregularities  in  nuclear  shape  or  size.  The  relative 
radial  positions  of  individual  nuclei  from  the  same  specimen  for  each 
histologic  entity  were  combined  to  generate  a  RRD  for  a  gene.  All 
alleles  in  a  nucleus  were  included  for  analysis,  and  nuclei  were  ana¬ 
lyzed  regardless  of  the  number  of  alleles  present,  unless  there  were 
no  signals  for  either  gene.  Because  some  nuclei  contained  FISH  sig¬ 
nals  for  only  one  gene,  when  the  number  of  nuclei  analyzed  is  indi¬ 
cated,  only  nuclei  containing  FISH  signals  for  the  indicated  gene  are 
included.  To  generate  PNDs,  normalized  EDTs  from  all  nuclei  from  all 
normal  tissues  analyzed  for  a  given  gene  were  combined  into  a  sin¬ 
gle  data  set  (see  Table  1  for  the  number  of  normal  tissues  analyzed 
for  each  gene  and  Supplemental  Table  SI  for  the  number  of  nuclei 
used  to  generate  each  gene's  PND).  Finally,  cumulative  RRDs  from 
different  samples  were  statistically  compared  using  the  nonparamet- 
ric  two-sample  1 D  KS  test.  The  null  hypothesis  that  the  samples  are 
from  the  same  distribution  was  rejected  if  p  <  0.01 . 
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Abstract  Genes  have  preferential  non-random  spatial 
positions  within  the  cell  nucleus.  The  nuclear  position  of 
a  subset  of  genes  differ  between  cell  types  and  some  genes 
undergo  repositioning  events  in  disease,  including  cancer. 
It  is  currently  unclear  whether  the  propensity  of  a  gene  to 
reposition  reflects  an  intrinsic  property  of  the  locus  or  the 
tissue.  Using  quantitative  FISH  analysis  of  a  set  of  genes 
which  reposition  in  cancer,  we  test  here  the  tissue  specific¬ 
ity  of  gene  repositioning  in  normal  and  malignant  breast 
or  prostate  tissues.  We  find  tissue-specific  organization  of 
the  genome  in  normal  breast  and  prostate  with  40  %  of 
genes  occupying  differential  positions  between  the  two  tis¬ 
sue  types.  While  we  demonstrate  limited  overlap  between 
gene  sets  that  repositioned  in  breast  and  prostate  cancer,  we 
identify  two  genes  that  undergo  disease-related  gene  repo¬ 
sitioning  in  both  cancer  types.  Our  findings  indicate  that 
gene  repositioning  in  cancer  is  tissue-of-origin  specific. 

Keywords  Nuclear  architecture  •  Spatial  gene 
positioning  •  Breast  cancer  •  Prostate  cancer  •  Tissue 
specificity 
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Abbreviations 

BAC  Bacterial  artificial  chromosome 

EDT  Euclidean  distance  transform 

FFPE  Formalin-fixed,  paraffin  embedded 
GPBs  Gene  positioning  biomarkers 
HSA  Human  chromosome 

KS  test  Kolmogorov-Smirnov  test 

MMU  Mouse  chromosome 

NAT  Normal  adjacent  to  tumor 

PND  Pooled  normal  distribution 

RRD  Relative  radial  distribution 

TMA  Tissue  microarray 

Introduction 

The  genome  is  non-randomly  organized  within  the  inter¬ 
phase  nucleus  (Ferrai  et  al.  2010;  Meaburn  et  al.  2016). 
There  is  a  general  tendency  for  gene-rich  and  gene-poor 
genomic  regions,  ranging  from  whole  chromosomes  to 
chromatin  domains,  to  be  spatially  separate  from  each 
other  (Boutanaev  et  al.  2005;  Boyle  et  al.  2001;  Croft  et  al. 
1999;  Guelen  et  al.  2008;  Lieberman-Aiden  et  al.  2009; 
Shopland  et  al.  2006).  While  this  trend  is  maintained  across 
species  and  between  cell  types,  the  preferred  nuclear  loca¬ 
tions  of  specific  chromosomes  and  genes  can  be  variable 
depending  on  cellular  context.  Spatial  reorganization  of 
the  genome  occurs,  for  example,  with  changes  in  prolifera¬ 
tion  status,  during  differentiation,  between  cell  types  and  in 
disease  (Ferrai  et  al.  2010;  Meaburn  et  al.  2016).  It  is  cur¬ 
rently  unclear  how  the  spatial  arrangements  of  the  genome 
are  established  and  maintained,  although  epigenetic  modi¬ 
fications,  chromatin  remodeling,  gene  expression  and  rep¬ 
lication  timing  have  been  implicated  in  the  positioning  of 
some  genes  (Ferrai  et  al.  2010;  Hiratani  et  al.  2008;  Kosak 
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et  al.  2002;  Meabum  et  al.  2016;  Peric-Hupkes  et  al.  2010; 
Therizols  et  al.  2014;  Towbin  et  al.  2012;  Volpi  et  al.  2000; 
Williams  et  al.  2002). 

The  organization  of  the  genome  between  terminally 
differentiated  cell  types  is  remarkably  similar  despite  pro¬ 
found  differences  in  function,  gene  expression  profiles  and 
nuclear  shape.  For  instance,  most  human  chromosomes 
(HSAs)  occupy  similar  radial  positions  in  lymphoblast  and 
dermal  fibroblast  nuclei,  with  only  HSA  8,  HSA  20  and 
HSA  21  in  different  positions  (Boyle  et  al.  2001;  Meaburn 
et  al.  2008).  This  phenomenon  is  not  limited  to  humans,  as 
there  is  also  a  high  level  of  conservation  in  chromosomal 
positioning  patterns  between  porcine  lymphoblasts  and 
fibroblasts,  where  chromosome  17  is  the  only  differentially 
positioned  chromosome  (Foster  et  al.  2012).  In  mice,  three 
of  six  chromosomes  analyzed  position  similarly  between 
lymphoblasts  and  fibroblasts  (Mayer  et  al.  2005).  Compari¬ 
sons  of  positioning  patterns  of  chromosomes  between  other 
cell  types  are  less  extensive;  however,  there  is  conserva¬ 
tion  in  positioning  patterns  for  at  least  some  chromosomes. 
The  positioning  of  HSA  18  and  HSA  19  is  broadly  main¬ 
tained  in  a  wide  range  of  cell  types,  including  fibroblasts, 
keratinocytes,  lymphocytes  and  epithelial  cells  from  multi¬ 
ple  tissues  (Boyle  et  al.  2001;  Cremer  et  al.  2003;  Murata 
et  al.  2007).  Moreover,  porcine  chromosome  17,  13,  5  and 
X  are  similarly  positioned  between  kidney,  lung  and  brain 
tissue  (Foster  et  al.  2012)  and  mouse  chromosome  (MMU) 
14  is  similarly  positioned  in  lung  and  kidney  cells  (Parada 
et  al.  2004).  However,  tissue-specific  positioning  of  specific 
chromosomes  does  also  occur.  All  six  mouse  chromosomes 
analyzed  in  freshly  isolated  small  and  large  lung  cells,  liver, 
kidney,  lymphocytes  and  myeloblasts  cells  are  in  differential 
positions  in  at  least  three  tissue  types  (Parada  et  al.  2004). 
For  example,  MMU  12  and  MMU  15  are  in  distinct  posi¬ 
tions  in  lung  and  kidney  cells,  yet  are  in  similar  positions  in 
lymphocytes  and  myeloblasts  (Parada  et  al.  2004).  Moreo¬ 
ver,  MMU  5  is  the  only  of  several  analyzed  chromosome  in 
a  different  spatial  position  between  lymphocytes  and  myelo¬ 
blasts,  whereas  MMU  1,  MMU  5,  MMU  6  and  MMU  12 
are  differentially  positioned  between  liver  and  myeloblasts 
(Parada  et  al.  2004).  These  observations  point  to  partial  con¬ 
servation  of  positioning  patterns  among  tissue  types. 

Similar,  yet  distinct,  spatial  arrangement  of  the  genome 
between  differentiated  cell  types  is  also  consistent  with 
genome-wide  analyses  (Battulin  et  al.  2015;  Peric-Hupkes 
et  al.  2010).  Approximately  80  %  of  genomic  regions  that 
associate  with  the  nuclear  lamina  protein  lamin  Bl,  used 
as  a  marker  of  the  nuclear  periphery,  in  in  vitro  differen¬ 
tiated  mouse  astrocytes  also  preferentially  associate  with 
the  nuclear  lamina  in  embryonic  fibroblasts  (Peric-Hupkes 
et  al.  2010).  Moreover,  despite  the  highly  compact  and 
haploid  nature  of  DNA  in  sperm,  the  genome-wide  spa¬ 
tial  organization  of  DNA  is  similar  in  mouse  sperm  and 


fibroblast  nuclei  when  analyzed  by  Hi-C  genome-wide 
crosslinking  methods  (Battulin  et  al.  2015).  The  organi¬ 
zation  is,  however,  not  fully  identical.  For  example,  there 
is  a  higher  frequency  of  long-range  interactions  in  sperm 
compared  to  fibroblasts  (Battulin  et  al.  2015).  Addition¬ 
ally,  in  other  cell  types,  some  gene  loci  have  tissue-specific 
positions.  For  example,  human  CFTR  is  more  peripher¬ 
ally  positioned  in  lymphocytes  and  embryonic  kidney 
cells  compared  to  nasal  epithelial  cells,  and  CORTBP2  is 
more  peripherally  positioned  in  embryonic  kidney  cells 
compared  to  lymphocytes  (Zink  et  al.  2004).  Furthermore, 
the  spatial  position  of  a  gene  with  respect  to  the  chromo¬ 
some  it  resides  on  may  also  vary  between  cell  types.  Some 
gene-rich  clusters  of  functionally  related  genes,  such  as  the 
epidermal  differentiation  complex  or  the  major  histocom¬ 
patibility  complex,  loop  out  from  the  bulk  of  their  chromo¬ 
some  territory  more  frequently  in  cell  types  where  they  are 
highly  expressed  (Volpi  et  al.  2000;  Williams  et  al.  2002). 
Other  genes,  such  as  PAX6,  WT1,  DMD,  FLNA  and  BCL2 , 
however,  remain  within  the  chromosome  territory  in  dif¬ 
ferent  cell  types,  independent  of  transcription  status  (Kurz 
et  al.  1996;  Mahy  et  al.  2002;  Scheuermann  et  al.  2004). 

Spatial  reorganization  of  the  genome  is  also  associated 
with  disease  (Borden  and  Manuelidis  1988;  Meaburn  et  al. 
2007;  Mehta  et  al.  2011;  Mewborn  et  al.  2010;  Mikelsaar 
et  al.  2014;  Paz  et  al.  2015),  including  cancer  (Cremer  et  al. 
2003;  Leshner  et  al.  2015;  Meabum  et  al.  2009;  Meabum 
and  Misteli  2008;  Murata  et  al.  2007;  Wiech  et  al.  2009; 
Zeitz  et  al.  2013).  Much  like  between  tissue  types,  changes 
in  genome  organization  are  not  global  and  only  subsets  of 
gene  loci  alter  their  nuclear  location  (Borden  and  Manue¬ 
lidis  1988;  Leshner  et  al.  2015;  Meaburn  et  al.  2007,  2009; 
Meabum  and  Misteli  2008;  Zeitz  et  al.  2013).  We  have 
previously  identified  two  sets  of  genes  that  reposition  in 
malignant  breast  or  prostate  tissue,  respectively,  compared 
to  their  normal  counterparts  (Leshner  et  al.  2015;  Mea¬ 
burn  et  al.  2009).  These  repositioning  events  are  specific 
to  the  malignant  state  and  do  not  commonly  occur  in  non- 
malignant  disease,  and  they  are  not  accounted  for  by  inter¬ 
individual  variations  nor  do  they  correlate  with  numerical 
genome  abnormalities,  gene  ontology,  the  local  gene  den¬ 
sity  surrounding  the  loci  or  changes  in  gene  expression 
(Leshner  et  al  2015;  Meaburn  et  al.  2009;  Meabum  and 
Misteli  2008).  In  line  with  partial  repositioning  of  genomes 
in  cancer,  the  majority  of  genes  do  not  change  radial  posi¬ 
tioning  in  neither  breast  nor  prostate  cancer,  suggesting 
gene-specific  repositioning  events  (Leshner  et  al.  2015; 
Meabum  et  al.  2009;  Meaburn  and  Misteli  2008). 

A  key  question  that  emerges  from  these  observations  is 
whether  the  same  genes  reposition  in  multiple  cancers  or 
whether  the  set  of  repositioned  genes  is  tissue  specific.  Pre¬ 
vious  analysis  has  found  repositioning  of  HSA  18  and  HSA 
19  in  many  types  of  cancer,  including  Hodgkin's  lymphoma, 
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melanoma,  colon,  cervical  and  thyroid  carcinomas,  suggest¬ 
ing  a  lack  of  tissue-of-origin  specificity  (Cremer  et  al.  2003; 
Murata  et  al.  2007;  Wiech  et  al.  2009).  However,  in  con¬ 
trast  to  the  other  types  of  cancers  studied,  HSA  18  is  more 
peripherally  located  in  a  cervical  squamous  carcinoma  tis¬ 
sue  (Wiech  et  al.  2009).  Moreover,  BCL2  is  relocated  to  a 
more  peripheral  nuclear  position  in  a  BCL2  expressing  cer¬ 
vical  squamous  carcinoma  (Wiech  et  al.  2009),  but  its  posi¬ 
tion  is  unaffected  in  breast  cancer  (Meaburn  et  al.  2009), 
prostate  cancer  (Leshner  et  al.  2015)  and  in  a  BCL2  negative 
cervical  squamous  carcinoma  (Wiech  et  al.  2009),  pointing 
to  tissue-specific  differences  in  repositioning  behavior. 

To  more  systematically  determine  whether  cancer- 
associated  gene  repositioning  is  gene-  or  tissue-of-origin 
specific,  we  have  here  compared  the  nuclear  positions  of 
a  set  of  eleven  genes,  which  we  have  previously  identi¬ 
fied  to  robustly  reposition  in  either  breast  or  prostate  can¬ 
cers  (Leshner  et  al.  2015;  Meaburn  et  al.  2009).  We  find 
that  the  repositioned  genes  are  largely  distinct  in  each  tis¬ 
sue  type  with  only  two  genes  repositioning  in  both  types  of 
cancer.  These  results  point  to  tissue-of-origin  specificity  for 
gene  repositioning  in  cancer. 

Materials  and  methods 
Tissue  FISH 

To  generate  fluorescence  in  situ  hybridization  (FISH)  probes, 
bacterial  artificial  chromosome  (BAC)  clones  (BACPAC 
resource  center)  (Suppl.  Table  SI)  were  label  with  either 
biotin-  or  digoxigenin-conjugated  dUTPs  (Roche)  by  nick 
translation,  as  previously  described  (Meaburn  2010;  Mea¬ 
burn  et  al.  2009).  FISH  was  performed  on  4-  to  5-pm-thick 
FFPE  de-identified  human  tissue  sections  (Suppl.  Table  S2), 
as  previously  described  in  Meaburn  et  al.  (2009)  and  using 
the  following  modifications:  the  60  °C  slide  baking  step  was 
not  performed,  tissue  sections  were  incubated  in  0.25  mg/ 
ml  proteinase  K  (Sigma- Aldrich)  for  15-20  min,  except 
for  single  tissue  slides  from  Biomax  Inc,  where  0.5  mg/ml 
proteinase  K  was  typically  required.  Tissue  sections  and  tis¬ 
sue  microarrays  (TMAs)  were  purchased  from  US  Biomax 
Inc,  Imgenex  Corporation,  Folio  Bioscience  and  Biochain 
Institute  or  were  acquired  from  the  University  of  Washing¬ 
ton  under  the  guidelines  and  approval  of  the  Institutional 
Review  Board  of  the  University  of  Washington  (#00-3449) 
(Suppl.  Table  S2).  The  prostate  tissues  from  the  University 
of  Washington  were  reviewed  by  a  genitourinary  pathologist 
(L.D.T.).  The  panel  of  tissues  included  twelve  breast  cancers, 
six  benign  breast  tissues  (hyperplasia  and  fibroadenoma),  six 
normal  breast  tissues,  20  prostate  cancers,  four  hyperplasic 
prostate  tissues  and  24  histologically  benign  (normal)  pros¬ 
tate  specimens  (Suppl.  Table  S2). 


Image  acquisition  and  FISH  analysis 

Image  accusation  was  performed  as  previously  described 
(Leshner  et  al.  2015;  Meaburn  et  al.  2009).  Briefly,  all 
imaging  was  performed  on  a  wide-field  1X70  (Olym¬ 
pus)  Deltavision  (Applied  Precision)  microscope  system, 
equipped  with  a  60  x  1.42  N  oil  objective  lens  (Olym¬ 
pus).  An  auxiliary  magnification  of  1.5  and  a  z-step  size  of 
0.5  pm  were  used  to  acquire  image  stacks.  Most  tissue  sec¬ 
tions  or  TMA  tissue  cores  contained  predominantly  a  sin¬ 
gle  morphology  (e.g.,  malignant  tissue  only),  and  regions 
of  epithelial  nuclei  were  randomly  imaged  over  the  slide 
or  tissue  core.  The  prostate  tissues  from  the  University  of 
Washington,  however,  often  contained  multiple  morpholo¬ 
gies  on  the  same  slide.  In  these  cases,  malignant  and  non- 
malignant  glands  were  imaged  and  analyzed  separately, 
after  examination  of  the  tissue  at  low  resolution  (lOx  lens; 
Olympus)  and  consultation  of  hematoxylin  and  eosin- 
stained  slides,  which  had  been  annotated  by  a  pathologist 
(L.D.T.).  Images  of  epithelial  nuclei  were  then  randomly 
acquired  within  the  predetermined  regions  of  benign  or 
malignant  tissue.  Before  image  analysis,  2D  maximum 
intensity  projections  were  generated  from  deconvolved 
image  stacks,  using  SoftWoRx  (Applied  Precision)  as  pre¬ 
viously  described  (Meaburn  et  al.  2009). 

Analysis  to  map  the  spatial  position  of  FISH  signals  was 
performed  as  previously  described  (Leshner  et  al.  2015; 
Meaburn  et  al.  2009).  Briefly,  89-169  interphase  epithe¬ 
lial  nuclei,  which  had  been  manually  segmented  in  Photo¬ 
shop  7.0  (Adobe)  based  on  DAPI  staining,  were  run  though 
custom  image  analysis  algorithms  (Meaburn  et  al.  2009), 
which  were  executed  in  MATLAB  (The  Mathsworks  Inc.) 
and  utilized  DIPImage  and  PRTools  toolboxes  (Deft  Uni¬ 
versity).  To  determine  the  radial  position  of  each  FISH 
signal,  the  distance  to  the  nearest  nuclear  boundary  was 
determined  for  each  pixel  in  a  cell  nucleus  using  Euclid¬ 
ean  distance  transform  (EDT)  and  the  geometric  gravity 
center  of  each  FISH  signal  was  determined  automatically. 
Next,  the  EDT  value  for  a  FISH  signal  was  normalized  to 
the  maximal  nuclear  EDT  for  the  corresponding  nucleus 
to  normalize  for  variations  in  nuclear  size  and  shape.  To 
generate  a  relative  radial  distribution  (RRD)  for  a  gene  in 
a  given  specimen,  the  normalized  FISH  signal  EDT  for 
every  allele  from  that  tissue  was  combined  and  a  cumula¬ 
tive  frequency  distribution  was  produced.  For  each  gene  a 
PND  was  also  created.  To  this  end,  the  normalized  EDTs 
from  all  allele  in  each  normal  tissue,  for  a  given  gene, 
were  combined.  Table  1  details  the  number  of  normal  tis¬ 
sues  analyzed  for  each  gene,  and  Suppl.  Table  SI  details 
the  number  of  nuclei  used  to  produce  each  PND.  Finally,  to 
statistically  compare  a  gene’s  positioning  patterns  between 
individuals  and  between  histological  states,  cumulative 
RRDs  between  tissues  were  cross-compared  using  the 
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nonparametric  two-sample  ID  Kolmogorov-Smirnov  (KS) 
test,  with  P  <  0.01  considered  significant. 

Some  previously  reported  data  were  included  in  the  cur¬ 
rent  analysis  for  comparison  of  the  positioning  pattern  of  a 
gene  between  normal  breast  and  prostate  tissues.  The  RRDs 
for  AKTI,  CSF1R,  ERBB2,  FOSL2,  HES5,  HSP90AA1, 
MYC,  TGFB3,  BCL2,  CCND1,  MMP1  and  VEGFA  in  nor¬ 
mal  breast  tissues  were  reported  in  Meaburn  et  al.  (2009) 
and  the  RRDs  for  FLI1,  MMP9,  MMP9 ,  BCL2,  CCND1, 
MMP1  and  VEGF  in  prostate  tissue  are  as  reported  in 
Leshner  et  al.  (2015). 

Results 

We  have  previously  identified  a  set  of  eight  genes  (AKTI, 
CSF1R,  ERBB2,  FOL2,  HES5,  HSP90AA1,  MYC  and 
TGFB3)  that  reposition  in  breast  cancer  (Meaburn  et  al. 
2009)  and  a  set  of  three  genes  (FLI1,  MMP2  and  MMP9 ) 
that  reposition  in  prostate  cancer  (Leshner  et  al.  2015).  We 
refer  to  these  gene  sets  as  breast  GBPs  (gene  positioning 
biomarkers)  and  prostate  GPBs,  respectively.  To  determine 
whether  the  observed  repositioning  events  between  nor¬ 
mal  and  cancerous  tissues  are  tissue  specific,  we  sought 
to  cross-compare  prostate  GPBs  and  breast  GPBs  in  their 
heterologous  tissues.  To  this  end,  we  performed  FISH  to 
visualize  the  GPBs  (Suppl.  Table  SI)  in  4-  to  5-pm-thick 
formalin-fixed,  paraffin-embedded  (FFPE)  human  tissues 
(Suppl.  Table  S2).  We  positioned  prostate  GPBs  in  benign 


Fig.  1  Tissue-specific  spatial  organization  of  the  genome,  a  FISH  ► 
was  used  to  detect  gene  loci  in  FFPE  tissue  sections.  FLI1  (green)  in 
a  normal  breast  tissue  and  HES5  (green)  in  a  normal  prostate  tissue. 
Blue,  DAPI  nuclear  counterstain.  Scale  bar  5  pm.  Projected  image 
stacks  are  shown,  b.  c  Heat  maps  representing  the  pairwise  statisti¬ 
cal  comparisons  of  the  positioning  patterns  of  the  indicated  genes 
between  normal  breast  tissues  (B-N1-B-N17;  see  Suppl.  Tables  S2, 
S3)  and  nonnal  prostate  tissues  (P-N1-P-N31;  see  Suppl.  Tables 
S2,  S3),  using  the  two-sample  ID  KS  test,  b  Genes  that  reposition 
in  either  breast  or  prostate  cancer  (or  both),  c  Genes  that  do  not  sig¬ 
nificantly  reposition  in  either  breast  or  prostate  cancer.  For  these 
comparisons,  the  RRDs  for  AKTI,  CSF1R,  ERBB2,  FOSL2,  HES5, 
HSP90AA1,  MYC,  TGFB3,  BCL2,  CCND1,  MMP1  and  VEGFA  in 
normal  breast  tissues  are  from  Meaburn  et  al.  (2009)  and  the  RRDs 
for  FLU,  MMP9,  MMP2,  BCL2,  CCND1.  MMP1  and  VEGF  in  pros¬ 
tate  tissue  are  from  Leshner  et  al.  (2015) 

(normal,  hyperplasia,  fibroadenoma)  and  malignant  breast 
tissues  and,  conversely,  we  positioned  breast  GPBs  in  nor¬ 
mal,  normal  adjacent  to  tumor  (NAT)  and  adenocarcinoma 
prostate  tissues  (Suppl.  Table  S2).  Typically  100-150  ran¬ 
domly  selected  epithelial  nuclei  were  analyzed  per  tissue 
for  each  gene  as  previously  described  (Leshner  et  al.  2015; 
Meaburn  et  al.  2009).  The  radial  position  of  each  allele, 
normalized  to  nuclear  size,  was  determined  from  projec¬ 
tions  of  image  stacks  using  EDT,  as  previously  described 
(see  Materials  and  Methods  for  details;  Leshner  et  al.  2015; 
Meaburn  et  al.  2009).  The  normalized  positions  for  each 
allele  in  a  tissue  were  then  combined  to  generate  a  cumu¬ 
lative  RRD  for  each  gene.  Statistical  differences  between 
samples  were  assessed  using  the  two-sample  1-D  KS  test 
as  described  (see  “Materials  and  methods”  ;  Leshner  et  al. 


Table  1  Comparison  of  gene 
positioning  patterns  between 
normal  breast  and  normal 
prostate  tissues 


Gene 

%  (and  number)  of  SD  cross  comparison  between  individual 
normal  breast  and  prostate  tissues 

Tissue-specific  positioning?2 

AKTI 

39.3  %  (ll/28)b 

No 

CSF1R 

64.9  %  (50/77)b 

Yes 

ERBB2 

30.6  %  (ll/36)b 

No 

FLI1 

20.8  %  (10/48)c 

No 

FOSL2 

46.0  %  (29/63 )b 

No 

HESS 

14.3  %  (6/42)b 

No 

HSP90AA1 

61.2  %  (26/42)b 

Yes 

MMP2 

28.1  %  (9/32)c 

No 

MMP9 

86.7  %  (52/60 )c 

Yes 

MYC 

97.2  %  (35/36)b 

Yes 

TGFB3 

25.0  %  (12/48)b 

No 

BCL2 

100.0%  (12/12)b,c 

Yes 

CCND1 

37.5  %  (6/16)b,c 

No 

MMP1 

66.7  %  (4/6)b,c 

Yes 

VEGF 

9.6  %  (5/52)b,c 

No 

SD  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01 

a  Between  normal  breast  and  prostate  tissue.  b'  c  These  comparisons  utilize  positioning  RRDs  (position 
distributions)  previously  generated,  specifically:  b  RRDs  from  breast  tissues  were  previously  published  in 
Meaburn  et  al.  (2009)  and  c  prostate  tissue  RRDs  are  from  Leshner  et  al.  (2015) 
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2015;  Meaburn  et  al.  2009),  and  P  <  0.01  was  considered 
significant. 

Tissue-specific  spatial  organization  in  normal  tissues 

To  first  determine  tissue  specificity  of  gene  positioning 
in  normal  tissues,  we  compared  the  positioning  patterns 
of  the  full  set  of  eleven  genes  between  multiple  normal 
breast  and  prostate  tissues  (Table  1,  Suppl.  Tables  S2-S3; 
Fig.  la,  b,  Suppl.  Fig.  Sla).  To  increase  the  number  of 
genes  compared  between  the  tissue  types,  for  this  analysis 
we  included  three  additional  genes  that  do  not  reposition 
significantly  in  breast  or  prostate  cancer  (BCL2,  CCND1 
and  MMP1 )  and  VEGF,  which  repositioned  in  approxi¬ 
mately  half  of  both  prostate  and  breast  cancers  (Table  1, 
Suppl.  Table  S4;  Fig.  lc,  Suppl.  Fig.  Sib).  Many  genes 


Fig.  2  Positioning  patterns  in  cancer  compared  to  normal  tissue,  a  ► 
MMP9  (green)  was  detected  by  FISH  in  normal  and  cancerous  breast 
FFPE  tissue  sections.  Blue ,  DAPI  nuclear  counterstain.  Scale  bar , 

5  pm.  Projected  image  stacks  are  shown,  b  Cumulative  RRDs  for  the 
indicated  genes  in  cancer  (red),  normal  tissues  (black)  and  the  pooled 
normal  distribution  (blue).  The  tissue  type  is  specified  in  each  graph. 
RRP  relative  radial  position 

exhibited  significantly  overlapping  distributions  between 
normal  breast  and  prostate  tissues.  For  six  genes  (40  %; 
VEGFA,  HES5,  FLU,  TGFB3,  MMP2  and  ERBB2)  less 
than  a  third  of  the  pairwise  cross  comparisons  of  RRDs 
between  normal  breast  and  prostate  tissue  were  signifi¬ 
cantly  different  and  for  an  additional  three  genes  (20  %; 
CCND1,  AKT1  and  FOSL2 )  37.5-46.0  %  of  the  cross  com¬ 
parisons  between  the  tissue  types  were  significant  (Table  1; 
Fig.  1,  Suppl.  Fig.  SI).  On  the  other  hand,  six  of  the  15 


Table  2  Cross  comparisons  Gene  Tissue  Number  of  tissues  %  (and  number)  of  SD  cross  comparison  between 

between  individual  tissues  _  _ 


Normal 

Cancer 

Individual  normal  tissues 

Individual  normal  and  cancer  tissues 

flii 

Breast 

6 

10 

33.3  %  (5/15) 

66.7  %  (40/60) 

MMP9 

Breast 

6 

12 

6.7  %  (1/15) 

66.7  %  (48/72) 

MMP2 

Breast 

4 

6 

50.0  %  (3/6) 

37.5  %  (9/24) 

AKT1 

Prostate 

4 

6 

16.7  %  (1/6) 

37.5  %  (9/24) 

CSF1R 

Prostate 

11 

9 

41.8  %  (23/55) 

38.4%  (38/99) 

ERBB2 

Prostate 

4 

5 

33.3  %  (2/6) 

35.0  %  (7/20) 

FOSL2 

Prostate 

9 

7 

38.9%  (14/36) 

47.6  %  (30/63) 

HES5 

Prostate 

6 

6 

20.0  %  (3/15) 

27.8  %  (10/36) 

HSP90AA1 

Prostate 

7 

4 

57.1  %  (12/21) 

46.4%  (13/28) 

MYC 

Prostate 

6 

5 

6.7  %  (1/15) 

3.3  %  (1/30) 

TGFB3 

Prostate 

6 

4 

33.3  %  (5/15) 

25.0  %  (6/24) 

SD  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01 


Table  3  Comparison  of  individual  tissues  to  a  pooled  normal  distribution 


Gene 

Tissue 

%  (and  number)  of  SD  cross  comparison  between 

Individual  normal  tissues  and  pooled  normal 

Individual  cancer  tissues  and  pooled  normal 

Fill 

Breast 

16.7  %  (1/6) 

100.0%  (10/10) 

MMP9 

Breast 

0.0  %  (0/6) 

83.3  %  (10/12) 

MMP2 

Breast 

25.0  %  (1/4) 

0.0  %  (0/6) 

AKT1 

Prostate 

0.0  %  (0/4) 

50.0  %  (3/6) 

CSF1R 

Prostate 

27.3  %  (3/11) 

44.4  %  (4/9) 

ERBB2 

Prostate 

0.0  %  (0/4) 

40.0  %  (2/5) 

FOSL2 

Prostate 

33.3  %  (3/9) 

42.9  %  (3/7) 

HES5 

Prostate 

0.0  %  (0/6) 

33.3  %  (2/6) 

HSP90AA1 

Prostate 

42.9  %  (3/7) 

50.0  %  (2/4) 

MYC 

Prostate 

0.0  %  (0/6) 

0.0  %  (0/5) 

TGFB3 

Prostate 

16.7  %  (1/6) 

25.0  %  (1/4) 

SD  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01 
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◄Fig.  3  Statistical  cross  comparisons  of  gene  positioning  in  cancer 
tissues.  Heat  maps  representing  the  pairwise  statistical  comparisons 
of  positioning  patterns  of  indicated  genes  between  tissues,  using 
the  two-sample  ID  KS  test.  B-N1-B-N6,  normal  breast  tissues  (see 
Suppl.  Table  S2);  B-C1-B-C12,  cancerous  breast  tissues;  P-Nl-P- 
N24,  normal  prostate  tissues;  P-C1-P-C20,  cancerous  prostate  tis¬ 
sues;  PND  pooled  normal  distribution.  Black  or  white  asterisks  indi¬ 
cate  a  cross  comparison  between  a  normal  and  cancer  specimen  from 
the  same  individual 

genes  (40  %;  HSP90AA1,  CSF1R,  MMP1,  MMP9,  MYC, 
and  BCL2 )  displayed  distinct  tissue-specific  positioning, 
with  61.2-100  %  of  the  pairwise  comparisons  between  nor¬ 
mal  breast  and  prostate  tissue  being  significantly  different 
(Table  1;  Fig.  1,  Suppl.  Fig.  SI).  These  observations  point 
to  a  partial  conservation  of  gene  positioning  between  breast 
and  prostate  tissues. 

Distinct  subsets  of  genes  reposition  in  breast 
and  prostate  cancer 

To  address  if  similar  repositioning  events  occur  in  multiple 
cancer  types  or  if  distinct  sets  of  genes  reposition  in  can¬ 
cers  originating  from  different  tissues,  we  compared  the 
repositioning  behavior  of  breast  GPBs  in  prostate  cancer 
and  vice  versa.  To  this  end,  we  first  compared  the  RRDs  of 
the  prostate  GPBs,  FLIP  MMP9  and  MMP2,  in  malignant 
breast  tissues  to  individual  normal  breast  tissues  and  to  a 
pooled  normal  distribution  (PND;  Tables  2,  3;  Figs.  2,  3), 
which  was  generated  by  averaging  the  RRDs  from  all  nor¬ 
mal  tissues,  of  a  given  tissue-type,  analyzed  into  a  single 
distribution  for  each  gene  as  previously  described  (Leshner 
et  al.  2015;  Meaburn  et  al.  2009;  see  Materials  and  meth¬ 
ods).  We  find  that  MMP2  was  similarly  positioned  in  breast 
cancer  and  normal  breast  tissue  with  37.5  %  (9/24)  of  indi¬ 
vidual  cross  comparisons  between  normal  and  malignant 
breast  tissues  significantly  different.  In  addition,  the  distri¬ 
bution  of  MMP2  was  not  statistically  significantly  differ¬ 
ent  from  the  PND  in  all  six  breast  cancer  tissues  (Tables  2, 
3;  Figs.  2b,  3).  Furthermore,  the  position  of  MMP2  was 
more  different  among  normal  tissues  than  when  normal 
and  cancer  tissue  was  compared  (Tables  2,  3;  Figs.  2b,  3). 
In  contrast  to  MMP2,  the  prostate  GPBs  FLI1  and  MMP9 
also  repositioned  in  breast  cancer  (Tables  2,  3;  Figs.  2,  3). 
For  both  genes,  66.7  %  (40/60  and  48/72,  respectively)  of 
cross  comparisons  between  normal  breast  and  breast  can¬ 
cer  were  significantly  different.  Moreover,  compared  to 
the  PND,  FLI1  was  repositioned  in  all  ten  breast  cancers 
and  MMP9  was  repositioned  in  83.3  %  (10/12)  of  breast 
cancers.  Importantly,  FLI1  and  MMP9  were  both  simi¬ 
larly  positioned  among  normal  breast  tissues,  with  only 
33.3  %  (5/15)  and  6.7  %  (1/15),  respectively,  of  individual 
cross  comparisons  between  normal  breast  tissues  reach¬ 
ing  significance.  Finally,  compared  to  their  PNDs,  FLU 
was  repositioned  in  a  single  (1/6)  normal  tissue  and  MMP9 


was  repositioned  in  none  (0/6)  of  the  normal  breast  tissues 
(Tables  2,  3;  Figs.  2b,  3). 

In  an  analogous  fashion,  we  next  determined  how  many 
of  the  previously  identified  breast  GPBs  ( AKT1 ,  CSF1R, 
ERBB2,  FOL2,  HES5 ,  HSP90AA1,  MYC  and  TGFB3) 
reposition  in  prostate  cancer.  We  find  that  none  of  these 
breast  GPBs  robustly  reposition  in  prostate  cancer  tissue 
(Tables  2,  3;  Figs.  2b,  3).  The  majority  of  cross  compari¬ 
sons  between  normal  and  prostate  cancer  tissues  were  not 
significantly  different  for  any  of  the  genes  with  between 
3.3-47.6  %  cross  comparisons  being  significantly  differ¬ 
ent.  For  most  genes,  the  number  of  significantly  different 
cross  comparisons  between  normal  and  malignant  pros¬ 
tate  tissues  was  similar  or  less  than  the  proportion  (16.7— 
57.1  %)  of  cross  comparisons  between  normal  tissues  that 
were  significantly  different,  suggesting  that  the  reposition¬ 
ing  was  not  cancer  specific  but  rather  related  to  the  vari¬ 
ability  in  positioning  patterns  between  individuals  (Table  2; 
Figs.  2b,  3).  Comparing  the  position  of  genes  in  prostate 
cancer  specimens  to  their  PND  also  identified  only  a  lim¬ 
ited  amount  of  repositioning  of  these  genes  in  prostate  can¬ 
cer  (Table  3;  Figs.  2b,  3).  MYC,  TGFB3  and  HES5  reposi¬ 
tioned  in  0-33.3  %  prostate  adenocarcinomas  compared  to 
their  PNDs  and  ERBB2,  FOL2,  CSF1R,  AKT1  and  TGFB3 
in  40.0-50.0  %  of  cancer  tissues  compared  to  their  PND 
(Table  3;  Fig.  3). 

The  radial  positions  of  genes  in  breast  tissues  are 
highly  conserved  between  individuals  (Figs.  2b,  3;  Mea¬ 
burn  et  al.  2009).  In  contrast,  we  have  previously  iden¬ 
tified  several  genes  that  occupy  more  variable  positions 
between  individuals  in  prostate  tissue  (Leshner  et  al. 
2015).  It  is  unclear  whether  this  is  a  feature  of  the  indi¬ 
vidual  loci  or  reflects  differences  in  the  degree  of  overall 
positional  conservation  between  breast  and  prostate  tis¬ 
sue.  In  keeping  with  their  positioning  patterns  in  breast 
tissue,  for  five  of  the  breast  GPBs  there  was  little  varia¬ 
bility  in  the  positioning  pattern  of  normal  prostate  tissues 
between  individuals,  with  16.7-33.3  %  of  individual  cross 
comparisons  among  normal  prostate  tissues  being  signifi¬ 
cantly  different  (Table  2;  Figs.  2b,  3).  Moreover,  for  four 
of  these  genes,  none  of  the  normal  tissues  were  signifi¬ 
cantly  different  to  the  PND,  and  for  TGFB3  only  a  single 
normal  tissue  (16.7  %;  1/6  tissues)  was  significantly  dif¬ 
ferent  to  the  PND  (Table  3;  Figs.  2b,  3).  Conversely,  in 
contrast  to  their  positioning  profiles  in  breast  tissue,  the 
positions  of  three  genes  ( FOL2 ,  CSF1R  and  HSP90AA1) 
were  more  variable  between  individuals,  with  38.9- 
57.4  %  of  individual  cross  comparisons  reaching  signifi¬ 
cance,  and  27.3-42.9  %  of  normal  tissues  significantly 
different  to  the  PND  (Tables  2,  3;  Figs.  2b,  3).  These  data 
suggest  that  differences  in  the  extent  of  inter-individual 
variations  of  gene  positioning  patterns  are  tissue  specific 
rather  than  gene  specific. 


5)  Springer 


/I 


thor's  personal  copy 


442 


Histochem  Cell  Biol  (2016)  145:433-446 


Table  4  Comparison  of  gene  positioning  in  benign  tissues 


Gene 

Tissue 

%  (and  number)  of  SD  cross  comparison  between 

Individual  benign  tissues 

Individual  normal  and  benign  tissues 

Individual  benign  tissues  and  pooled  normal 

FLI1 

Breast 

10.0%  (1/10) 

26.7  %  (8/30) 

40.0  %  (2/5) 

MMP9 

Breast 

10.0  %  (1/10) 

20.0  %  (6/30) 

20.0  %  (1/5) 

MMP2 

Breast 

0.0  %  (0/10) 

45.0  %  (9/20) 

40.0  %  (2/5) 

CSF1R 

Prostate 

0.0  %  (0/6) 

18.2  %  (8/44) 

0.0  %  (0/4) 

SD  significantly  different,  based  on  a  two-sample  ID  KS  test,  P  <  0.01 
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Fig.  4  Gene  positioning  in  non-malignant  disease  resembles  normal 
tissue.  Positions  of  indicated  genes  were  compared  between  benign 
breast  disease  (hyperplasia:  B-B1-B-B3;  fibroadenoma:  B-B4-B-B6; 
see  Suppl.  Table  S2)  and  normal  breast  tissue  (B-N1-B-N6).  Left- 
hand  panel  cumulative  RRDs  for  the  indicated  genes  in  benign  (red). 


normal  tissues  (black)  and  the  pooled  normal  distribution  (PND; 
blue).  Right-hand  panel  pairwise  statistical  comparisons  of  RRDs 
between  benign  and  normal  tissues  and  among  benign  tissues,  using 
the  two-sample  ID  KS  test.  RRP  relative  radial  position 


We  conclude  that  only  two  of  the  eleven  marker  genes 
(18.2  %;  MMP9  and  FLI1;  Suppl.  Table  S4)  reposition  in 
both  breast  and  prostate  cancer,  demonstrating  that  cancer- 
related  repositioning  is  not  a  gene-intrinsic  feature  but  that 
distinct  sets  of  genes  reposition  in  cancer  in  a  tissue-of-ori- 
gin-specihc  manner. 

The  spatial  repositioning  of  FLU  and  MMP9  is  specific 
to  cancer 

In  addition  to  cancer,  we  have  shown  that  genes  can  reposi¬ 
tion  in  breast  and  prostate  tissue  with  abnormal  but  non- 
malignant  histology  (Leshner  et  al.  2015;  Meaburn  et  al. 
2009).  Therefore,  to  ascertain  if  the  repositioning  of  the 


prostate  GPBs  markers  FLI1  and  MMP9  in  breast  cancer 
tissues  was  specific  to  cancer  or  also  occurred  in  abnor¬ 
mal,  but  non-malignant,  breast  tissue,  we  positioned  these 
genes  in  benign  hyperplasia  and  fibroadenoma  breast  tis¬ 
sues  (Table  4,  Suppl.  Table  S2;  Fig.  4).  The  positions  of 
both  FLU  and  MMP9  were  similar  in  the  non-malignant 
diseased  tissues  with  only  a  single  cross  comparison  reach¬ 
ing  significance  for  each  gene  (1/10;  Table  4;  Fig.  4).  There 
was  also  only  limited  repositioning  of  FLI1  and  MMP9 
when  the  non-malignant  diseased  tissues  were  compared 
to  normal  breast  tissues,  with  20-26.7  %  of  cross  compari¬ 
sons  to  individual  normal  tissues  and  20-40  %  (1/5  and  2/5, 
respectively)  of  comparisons  to  the  PND  being  significant 
(Table  4;  Fig.  4).  These  data  suggest  that  FLU  and  MMP9 
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Table  5  False-positive  and  false-negative  rates 


Gene 

Tissue 

Normal  tissue  false  positives 

Benign  tissue  false  positives 

Total  false  positives 

False-negative  rate 

FLI1 

Breast 

16.7  %  (1/6) 

40.0  %  (2/5) 

27.3  %  (3/11) 

0.0  %  (0/10) 

MMP9 

Breast 

0.0  %  (0/6) 

20.0  %  (1/5) 

9.1  %  (1/11) 

16.7  %  (2/12) 

MMP2 

Breast 

25.0%  (1/4) 

40.0  %  (2/5) 

33.3  %  (3/9) 

100.0  %  (6/6) 

AKT1 

Prostate 

0.0  %  (0/4) 

ND 

0.0  %  (0/4) 

50.0  %  (3/6) 

CSF1R 

Prostate 

27.3%  (3/11) 

0.0  %  (0/4) 

20.0  %  (3/15) 

55.6  %  (5/9) 

ERBB2 

Prostate 

0.0  %  (0/4) 

ND 

0.0  %  (0/4) 

60.0  %  (3/5) 

FOSL2 

Prostate 

33.3  %  (3/9) 

ND 

33.3  %  (3/9) 

57.1  %  (4/7) 

HES5 

Prostate 

0.0  %  (0/6) 

ND 

0.0  %  (0/6) 

66.7  %  (4/6) 

HSP90AA1 

Prostate 

42.9  %  (3/7) 

ND 

42.9  %  (3/7) 

50.0  %  (2/4) 

MYC 

Prostate 

0.0  %  (0/6) 

ND 

0.0  %  (0/6) 

100.0  %  (5/5) 

TGFB3 

Prostate 

16.7  %  (1/6) 

ND 

16.7  %  (1/6) 

75.0  %  (3/4) 

The  percentage  (and  number)  of  tissues  that  give  a  false-positive  or  false-negative  result.  A  false  positive  is  scored  when  a  gene  has  a  statistically 
significantly  different  RRD  in  a  normal,  fibroadenoma  or  hyperplasic  tissue  compared  to  the  pooled  normal  (P  <  0.01;  KS  test).  A  false  negative 
is  scored  when  a  gene  has  a  similar  RRD  in  a  cancer  tissue  to  that  of  the  pooled  normal  distribution  (KS  test,  P  >  0.01) 

ND  not  determined 


repositioning  is  specific  to  cancer  and  not  a  general  feature 
of  a  diseased  phenotype. 

Since  MMP2  repositioned  in  hyperplasic  prostate  tissue 
(Leshner  et  al.  2015),  we  also  positioned  it  in  benign  breast 
disease.  MMP2  was  positioned  identically  among  all  the 
non-malignant  diseased  tissues  (Table  4;  Suppl.  Fig.  S2) 
and  showed  a  low  level  of  repositioning  in  the  non-malig- 
nant  diseased  breast  tissues  compared  to  normal  tissue  with 
45.0  %  (9/20)  of  the  cross  comparisons  to  individual  nor¬ 
mal  tissues  reaching  significance  and  only  two  (40  %)  of 
non-malignant  diseased  tissues  significantly  different  to  the 
PND  tissues  (Table  4;  Suppl.  Fig.  S2),  further  demonstrat¬ 
ing  the  tissue-specific  nature  of  the  MMP2  repositioning 
event. 

Normal  and  NAT  prostate  tissue  often  contain  small 
areas  of  mild  hyperplasic  morphology.  Therefore,  as  a  con¬ 
trol  to  ensure  the  heterogeneity  in  positioning  patterns  for 
some  genes  in  normal  prostate  tissues  was  not  a  reflection 
of  the  inclusion  of  abnormal  tissue,  we  positioned  CSF1R, 
which  displayed  a  high  level  of  variability  between  individ¬ 
uals  in  hyperplasic  prostate  tissues.  CSF1R  was  positioned 
identically  among  all  hyperplasic  prostate  tissues  (Table  4; 
Suppl.  Fig.  S2).  Similarly,  the  position  of  CSF1R  was  also 
similar  between  hyperplasic  and  normal  tissues  with  all 
four  hyperplasic  prostate  tissues  distributions  statistically 
similar  to  the  PND  and  only  18.2  %  (8/44)  of  cross  com¬ 
parisons  between  hyperplasic  and  individual  normal  tis¬ 
sues  being  significantly  different  (Table  4;  Suppl.  Fig.  S2). 
This  suggests  that  the  presence  of  hyperplasic  nuclei  within 
a  normal  tissue  RRD  is  not  skewing  spatial  positioning 
patterns. 

Finally,  the  high  rate  of  repositioning  of  FLI1  and 
MMP9  in  breast  cancer  tissue  combined  with  the  low  level 


of  repositioning  in  benign  tissues  make  these  two  genes 
potential  breast  GPBs  for  diagnostic  applications.  To  assess 
their  suitability,  we  determined  false-positive  and  false¬ 
negative  rates  for  these  two  genes  in  breast  cancer  tissues 
(Table  5).  A  false  positive  is  defined  as  spatial  reposition¬ 
ing  of  a  gene  in  a  non-malignant  tissue  compared  to  the 
PND,  which  would  incorrectly  classify  the  tissue  as  can¬ 
cer.  Conversely,  a  false  negative  is  defined  as  a  cancer  tis¬ 
sue  where  the  given  gene  did  not  reposition,  which  would 
incorrectly  classify  the  tissue  as  non-malignant.  Both  FLI1 
and  MMP9  had  low  false-negative  rates  in  breast  cancer  of 
0  %  (0/10)  and  16.7  %  (2/12),  respectively  (Table  5),  and 
had  false  positive  rates  of  27.3  %  (3/11)  and  9.1  %  (1/11), 
respectively  (Table  5).  These  results  suggest  that  FLI1  and 
MMP9,  in  addition  to  being  prostate  GPBs,  are  also  poten¬ 
tial  candidates  as  breast  cancer  biomarkers. 


Discussion 

We  report  here  a  systematic  comparison  of  the  spatial  posi¬ 
tioning  patterns  of  a  diverse  set  of  gene  loci  in  both  normal 
and  malignant  breast  and  prostate  tissues.  For  many  genes, 
we  find  conservation  of  positioning  patterns  between  non- 
malignant  breast  and  prostate  tissues.  We  identify  only  two 
genes  that  are  repositioned  in  both  breast  and  prostate  can¬ 
cer,  compared  to  their  normal  counterparts.  These  observa¬ 
tions  demonstrate  that  cancer-related  gene  repositioning 
events  are  tissue-of-origin  specific. 

Consistent  with  previous  studies  in  other  tissues  and 
cell  types  (Battulin  et  al.  2015;  Boyle  et  al.  2001;  Cremer 
et  al.  2003;  Foster  et  al.  2012;  Mayer  et  al.  2005;  Parada 
et  al.  2004;  Peric-Hupkes  et  al.  2010),  we  find  considerable 
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conservation  of  spatial  positioning  patterns  between  nor¬ 
mal  breast  and  prostate  tissues,  with  60  %  of  the  15  genes 
analyzed  in  similar  radial  positions  between  the  two  organs. 
The  observed  differential  positioning  patterns  of  some 
genes  between  breast  and  prostate  are  unlikely  to  reflect 
global  genome  reorganization  or  repositioning  of  whole 
chromosomes,  but  appear  to  reflect  changes  at  the  level  of 
individual  genes.  In  support,  while  the  position  of  MMP1 
was  dependent  on  tissue  type,  two  other  genes  on  HSA  11, 
FLll  and  CCND1,  are  similarly  positioned  between  breast 
and  prostate  tissue.  Similarly,  of  the  three  genes  on  HSA  14, 
only  HSP90AA1  differs  between  breast  and  prostate  tissue. 

Our  study  extends  growing  evidence,  from  multiple 
tissue  types  including  brain,  thyroid,  breast  and  pros¬ 
tate,  that  the  spatial  positioning  of  the  genome  is  largely 
conserved  between  individuals  (Borden  and  Manuelidis 
1988;  Leshner  et  al.  2015;  Meaburn  et  al.  2009;  Murata 
et  al.  2007;  Timme  et  al.  2011;  Wiech  et  al.  2005).  We 
find  that  18  genes  analyzed  in  multiple  morphologically 
normal  breast  tissues  all  displayed  a  high  level  of  con¬ 
servation  of  positioning  between  individuals  (this  study 
and  Meaburn  et  al.  2009).  In  addition,  in  prostate  tissues 
44/48  (91.7  %)  of  genes  have  similar  positioning  patterns 
among  individuals  (this  study  and  Leshner  et  al.  2015). 
However,  some  inter-individual  variability  was  found, 
particularly  in  prostate.  Unlike  in  breast  (Meaburn  et  al. 
2009),  we  find  that  radial  distributions  of  FOSL2,  CSF1R 
and  HSP90AA1  are  variable  between  individual  prostate 
tissues.  Similarly,  the  position  of  CCND1  was  similar 
between  breast  tissues  (Meaburn  et  al.  2009),  yet  variable 
among  prostate  tissues  from  multiple  individuals  (Leshner 
et  al.  2015).  This  observation  leads  to  the  possibility  that 
some  tissues  have  a  higher  degree  of  intrinsic  variability 
in  spatial  genome  organization  patterns.  Due  to  the  high 
prevalence  of  benign  hyperplasia  of  the  prostate,  another 
possible  explanation  is  that  the  variability  in  positioning 
patterns  between  prostates  reflects  a  varying  amount  of 
hyperplasic  nuclei  included  in  the  analysis  of  “normal” 
tissue,  although  our  observation  of  similar  positioning  of 
CSF1R  in  hyperplasic  and  normal  argues  against  this  sce¬ 
nario.  It  is  also  possible  that  the  extent  of  other  benign 
morphologies,  such  as  proliferative  inflammatory  atrophy 
or  prostatic  intraepithelial  neoplasia,  may  affect  position¬ 
ing  patterns  between  individuals. 

We  have  previously  identified  eight  genes  that  reposi¬ 
tion  in  breast  cancer  tissues  (Meaburn  et  al.  2009).  By 
testing  prostate  GPBs  in  breast  tissue  as  part  of  our  cross 
comparison,  we  have  identified  two  additional  genes  ( FLU 
and  MMP9)  that  reposition  in  breast  cancer,  but  not  benign 
breast  tissues.  When  we  combine  our  current  findings  with 
our  previous  studies  (Leshner  et  al.  2015;  Meaburn  et  al. 


2009),  we  find  that  6.4  %  (3/47)  of  analyzed  genes  reposi¬ 
tion  in  prostate  cancer  and  43.5  %  (10/23)  of  tested  genes 
reposition  in  breast  cancer.  It  is  unclear  why  the  propor¬ 
tion  of  repositioning  genes  is  considerably  lower  in  pros¬ 
tate  cancer,  although  the  higher  inter-individual  variability 
in  prostate  may  be  a  contributing  factor.  Future  studies 
employing  high-throughput  imaging  to  map  a  larger  num¬ 
ber  of  genes  should  determine  if  this  difference  is  due  to 
the  limited  number  of  tested  genes  or  reflects  underlying 
biological  properties  or  both.  The  molecular  properties  that 
determine  whether  a  gene  repositions  between  normal  and 
cancer  tissues  are  currently  unknown.  Analysis  of  reposi¬ 
tioned  genes  in  breast  and  prostate  cancer  has  demonstrated 
that  cancer-related  repositioning  does  not  correlate  with 
their  gene  expression  status,  gene  copy  number  or  local 
gene  density  (Leshner  et  al.  2015;  Meaburn  et  al.  2009; 
Meaburn  and  Misteli  2008). 

We  find  that  a  particular  gene  can  behave  differently  in 
cancers  from  different  tissues.  Of  the  eleven  genes  previ¬ 
ously  identified  to  reposition  in  either  breast  (Meaburn 
et  al.  2009)  or  prostate  cancer  (Leshner  et  al.  2015),  only 
two  ( FLU  and  MMP9)  spatially  reposition  in  both  breast 
and  prostate  cancer.  In  keeping  with  our  finding,  BCL2  has 
previously  been  identified  to  reposition  in  some  cervical 
squamous  carcinomas  (Wiech  et  al.  2009),  but  not  breast 
(Meaburn  et  al.  2009)  or  prostate  (Leshner  et  al.  2015)  can¬ 
cer.  This  suggests  that  distinct  groups  of  genes  reposition 
in  cancers  from  different  tissues.  While  breast  and  prostate 
cancer  share  biological  features  such  as  their  origin  from 
epithelial  cells  and  dependence  on  sex  steroid  hormones 
(Risbridger  et  al.  2010),  it  will  be  interesting  to  determine 
in  future  studies  how  general  the  repositioning  of  FLll  and 
MMP9  is  in  other  cancers. 

The  fact  that  different  genes  spatially  reposition  in  dif¬ 
ferent  cancer  types  may  have  implications  for  diagnostics 
as  it  suggests  a  correlation  between  non-random  reposi¬ 
tioning  of  the  genome  during  carcinogenesis  with  the  spe¬ 
cific  biology  of  the  cancer.  This  leads  to  the  possibility  that 
differential  positioning  patterns  for  specific  genes  may 
occur  even  within  cancers  originating  from  the  same  tis¬ 
sue,  including  among  subtypes  of  tumors.  Identification  of 
subtype-specific  positioning  biomarkers  may  be  valuable  in 
cancer  prognostics,  for  example  in  distinguishing  indolent 
from  aggressive  cancers. 
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x.l  Abstract 

The  nucleus  is  a  complex  organelle  that  performs  a  wide  array  of  critical  functions. 
Within  the  nucleus  the  genome  is  highly  organized.  Individual  chromosomes  form 
discrete  chromosome  territories.  The  organization  of  the  genome  is  correlated  with 
function,  for  example  gene  expression.  Each  chromosome  and  gene  has  a  preferential 
spatial  location,  which  can  vary  by  cell  type,  differentiation  stage  and  during  disease. 
Active  and  inactive  chromatin  tends  to  be  spatially  separated,  both  within  the  3D  nuclear 
space  and  within  a  chromosome  territory.  The  molecular  mechanisms  that  determine 
genome  organization  are  currently  poorly  understood.  However,  it  is  known  that  the 
proximity  of  gene  loci  can  contribute  to  translocation  partner  choice.  The  recent 
development  of  a  plethora  of  new  molecular  techniques  and  imaging  strategies, 
combined  with  fluorescent  in  situ  hybridization,  is  being  applied  to  both  normal  and 
diseased  cells.  Such  studies  will  bring  us  closer  to  understanding  the  implications  of 
genome  organization  and  the  molecules  and  mechanisms  that  determine  it. 

x.2  Introduction 

The  basic  principles  underlying  how  chromosomes  behave  and  organize  themselves 
during  mitosis,  compacting  into  tightly  condensed  X-shaped  structures,  aligning  and 
then  separating  into  daughter  nuclei,  have  been  appreciated  since  the  late  1800's.  The 
interphase  nucleus,  however,  has  historically  been  far  more  of  a  black  box  with  regards 
to  its  compartmentalization  and  organization.  Yet  cells,  and  consequently 
chromosomes,  spend  most  of  their  time  in  interphase.  It  is  during  this  time  that  they 
perform  the  majority  of  their  functions. 
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Ground-breaking  work  over  the  last  several  decades  has  revealed  that  the  nucleus  is 
highly  compartmentalized  and  that  chromosomes  persist  as  distinct  nuclear 
subdomains,  known  as  chromosome  territories  (CTs)  (fig.  x.l).  These  territories  persist 
throughout  the  cell  cycle,  and  remain  largely  separate  from  each  other  occupying 
reproducible  positions  within  interphase  nuclei  (Cremerand  Cremer  2001;  Meaburn  and 
Misteli  2007).  The  first  evidence  that  chromosomes  occupy  discrete  territories  came  in 
the  1970's  and  early  80's  from  a  series  of  UV-laser  microbeam  experiments  (Cremer 
and  Cremer  2001;  Cremer  et  al.  2014).  A  microlaser  was  used  to  create  DNA  damage 
to  a  small  volume  of  the  interphase  nucleus.  The  sites  of  damage  detected  in  the 
subsequent  metaphase  were  limited  to  only  a  few  chromosomes  per  cell,  indicating  that 
chromosomes  occupy  discrete  domains  in  the  nucleus  since  damage  to  many  more 
chromosomes  would  have  been  expected  if  the  DNA  from  each  chromosome  was 
dispersed  throughout  the  nucleus.  The  first  direct  visualization  of  CTs  came  in  1985, 
when  total  human  genomic  DNA  was  used  as  a  probe  to  reveal  the  location  of  human 
DNA  in  interphase  nuclei  of  humamrodent  hybrid  cells,  which  contained  either  one  or  a 
few  human  chromosomes  (Cremer  et  al.  2014).  However,  it  was  not  until  the 
development  and  expanded  use  of  DNA  fluorescent  in  situ  hybridization  (FISH)  in 
interphase  cells  in  the  1980's  (Cremer  and  Cremer  2001;  Cremer  et  al.  2014)  that  it 
became  possible  to  truly  elucidate  interphase  chromosome  topography,  and  to  begin  to 
probe  the  relationship  between  the  spatial  organization  of  the  genome  and  nuclear 
function.  In  FISH  whole  chromosomes  or  individual  genomic  regions  are  "painted”  with 
fluorescent  labels  enabling  the  visualization  of  specific  sub-regions  of  the  genome.  More 
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recently,  a  host  of  complementary  biochemical  techniques  have  been  devised  to  probe 
genome  organization,  such  as  chromatin  conformation  capture  (3C)-based  technologies 
and  DNA  adenine  methyltransferase  identification  (DamID).  These  techniques  are  now 
confirming  conclusions  drawn  from  FISH  experiments  on  a  genome-wide  scale  and  are 
yielding  further  insights  into  spatial  genome  organization  (Bickmore  and  van  Steensel 
2013;  van  Steensel  and  Dekker  2010).  The  group  of  3C-technologies  measure  the 
contact  frequency  of  pairs  of  genomic  loci  in  fixed  nuclei,  to  determine  regions  that  are 
in  close  3D  spatial  proximity  within  a  population  of  cells  (reviewed  in  (van  Steensel  and 
Dekker  2010))  The  different  C-techniques  are  variations  on  the  same  basic  principle, 
differing  mainly  by  how  much  of  the  genome  they  cover.  3C  detects  the  contact 
frequencies  of  a  known  locus  to  selected  target  loci  of  interest,  4C  takes  this  one  step 
further  and  asks  where  a  single  "bait”  locus  contacts  other  chromatin  regions  across  the 
whole  genome.  In  a  more  unbiased  approach,  Hi-C  maps  all  contacts  in  the  genome. 
The  resolution  of  these  methods  is  limited  by  the  sequencing  depth.  For  example,  5C  is 
a  similar  "all  against  all"  approach  as  Hi-C,  but  uses  a  small,  defined  region  as  a  target, 
thus  increasing  resolution  (Bickmore  and  van  Steensel  2013;  van  Steensel  and  Dekker 
2010).  DamID  is  a  complementary  tool  for  detecting  genomic  interaction  (proximity)  with 
nuclear  sub-compartments.  To  this  end,  DNA  adenine  methyltransferase  is  fused  to  a 
protein  of  interest,  such  as  the  nuclear  lamina  protein  lamin  B.  This  fusion  protein 
methylates  adenines  in  DNA  that  is  in  proximity  with  the  fusion  protein,  thus  mapping 
associated  genomic  regions  (reviewed  in  (van  Steensel  and  Dekker  2010)). 
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While  there  is  no  question  that  the  genome  is  highly  organized  within  interphase  nuclei, 
we  are  still  a  long  way  from  fully  understanding  what  this  means  functionally  and  what 
molecules  are  responsible  for  positioning  and  repositioning  the  genome.  For  the 
purpose  of  this  review,  we  focus  on  mammalian  genomes,  but  many  properties  of 
genome  organization  hold  true  for  many  other  species,  including  plants,  yeast,  C. 
elegans  and  drosophila  (Bickmore  and  van  Steensel  2013;  Del  Prete  et  al.  2014; 
Egecioglu  and  Brickner  2011;  Sexton  et  al.  2012;  Sharma  and  Meister  2013;  Zimmer 
and  Fabre  2011). 

x.3  Non-random  genome  organization 

x.3.1  Spatial  separation  of  gene-rich  and  gene-poor  genomic  regions 
The  genome  is  highly  organized  within  interphase  nuclei.  Each  chromosome  and  gene 
occupies  a  preferred  nuclear  position  (fig  x.l)  (Boyle  et  al.  2001;  Cremer  and  Cremer 
2010;  Meaburn  and  Misteli  2007).  The  prototypical  example  of  this  is  human 
chromosomes  (FISA)  18  and  19  in  proliferating  cells.  The  gene-rich  FISA19  is  located  in 
the  center  of  the  nucleus,  while  the  gene-poor  FISA18  locates  to  the  nuclear  periphery 
(Boyle  et  al.  2001;  Croft  et  al.  1999).  Indeed,  in  proliferating  human  cells  the  radial 
position  (position  relative  to  the  edge  and  center  of  the  nucleus)  is  highly  correlated  with 
gene  density,  whereby  CTs  generally  follow  the  pattern  of  FISA18  and  19,  with  the 
gene-rich  chromosomes  in  the  center  of  the  nucleus  and  gene-poor  chromosome 
towards  the  periphery  (Boyle  et  al.  2001).  Chromosome  positioning  patterns  can  also  be 
correlated  with  chromosome  size  (Bolzer  et  al.  2005;  Bridger  et  al.  2000;  Sun  et  al. 
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2000).  In  this  case,  the  small  chromosomes  tend  to  position  to  the  nuclear  interior  and 
the  larger  chromosomes  prefer  the  periphery. 

Remarkably,  even  within  a  CT  there  is  spatial  separation  of  gene-rich  and  gene-poor 
DNA  (fig  x.2).  Using  FISH,  it  was  demonstrated  that  gene-rich  and  gene-poor  DNA 
within  a  7MB  domain  of  Drosophila  chromosome  2  (Boutanaev  et  al.  2005)  or  a  4.3MB 
domain  of  mouse  chromosome  (MMU)  14  (Shopland  et  al.  2006)  cluster  separately  from 
each  other.  This  holds  true  at  the  whole  CT  level.  The  exome  of  the  MMU2 
chromosome  is  not  found  evenly  throughout  its  CT,  and  instead  localizes  to  the  part  of 
the  CT  which  faces  the  nuclear  interior,  and  beyond  the  bulk  of  the  CT  (Boyle  et  al. 
2011).  In  the  case  of  the  HSAX  chromosome,  genes,  regardless  of  their  activity,  locate 
to  the  periphery  of  the  CT,  while  the  non-genic  DNA  localizes  to  the  interior  of  the 
territory  (Clemson  etal.  2006).  Genome-wide  analysis  has  shown  this  spatial  separation 
is  not  limited  to  a  few  isolated  cases.  Early  Hi-C  analysis,  at  the  resolution  of  1Mb, 
revealed  that  gene-rich  and  transcriptionally  active  chromatin  regions,  and  gene-rich 
chromosomes,  tend  to  cluster  together  within  the  nuclear  space,  as  do  gene-poor, 
inactive  chromatin  regions.  Moreover,  gene-poor  and  gene-rich  regions  are  far  less 
likely  to  cluster  with  each  other  (Lieberman-Aiden  et  al.  2009).  Hi-C  and  4C  have  also 
shown  that  intra-chromosomal  interactions  are  far  more  common  than  trans- 
chromosomal  interactions,  fitting  with  a  genome  organized  into  discrete  CTs,  and  even 
the  p  and  q  arms  of  the  same  chromosome  do  not  interact  at  high  frequency  (Bickmore 
and  van  Steensel  2013;  Lieberman-Aiden  etal.  2009).  FISH  data  also  reveals  that  CTs 
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are  not  completely  isolated  from  each  other.  For  example  by  cryo-F IS H ,  -40%  of  a 
given  CT  volume  intermingles  with  other  CTs  (Branco  and  Pombo  2006). 

At  a  higher  resolution,  Hi-C  and  5C  can  resolve  internal  CT  domains  further  and  has 
revealed  the  presence  of  the  so-called  topologically  associated  domains 
(TADsMBickmore  and  van  Steensel  2013;  Dixon  et  al.  2012).  TADs  are  discrete  kb-Mb- 
sized  domains,  which  are  defined  as  genomic  regions  that  are  more  likely  to  be  spatially 
close  to  other  loci  within  the  same  TAD  than  within  another  TAD  (Bickmore  and  van 
Steensel  2013;  Cavalli  and  Misteli  2013;  Dixon  et  al.  2012).  TADs  are  separated  by 
sharp  boundaries,  and  are  conserved  both  between  cell  types  and  species  (Dixon  et  al. 
2012).  TAD  borders  are  characterized  by  enrichment  for  certain  factors,  such  as  CTCF 
binding  sites,  housekeeping  genes  and  "active”  histone  modifications,  and  by  depletion 
of  heterochromatin  marks  such  as  FI3K9me3  (Dixon  et  al.  2012).  It  has  been  suggested 
that  the  boundaries  between  TADs  may  function  to  prevent  the  spread  of 
heterochromatin  (Dixon  et  al.  2012).  Despite  the  topological  constraints  imposed  by 
such  local  domains,  long  range  chromatin  interactions  still  occur  within  the  nucleus, 
particularly  between  promoters  and  distal  regulatory  elements  (Dekker  and  Misteli 
2015). 

Spatial  positioning  due  to  gene  density  and/or  size  cannot  be  the  full  story  however, 
since  the  positioning  patterns  of  chromosomes  are  not  identical  in  all  cells.  For  example, 
some,  but  not  all,  CTs  are  differentially  positioned  in  proliferating  and  non-proliferating 
cells,  as  indicated  by  FISA18  and  13  that  are  peripheral  in  proliferating  cells  but 
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internally  positioned  in  Go  cells  (Bridger  et  al.  2000;  Meaburn  and  Misteli  2008;  Mehta  et 
al.  2010).  Similarly,  although  the  organization  of  CTs  is  broadly  similar  across  different 
tissues,  some  chromosomes  occupy  alternative  positions  in  different  cell  types  (Boyle  et 
al.  2001;  Foster  et  al.  2012;  Parada  et  al.  2004;  Zuleger  et  al.  2013),  such  as  the  small 
and  gene-poor  HSA21  which  locates  to  the  nuclear  periphery  in  lymphoblasts  but  to  the 
nuclear  interior  in  fibroblasts  (Boyle  et  al.  2001).  Obviously,  gene  density  and 
chromosome  size  are  identical  between  these  different  cells,  indicating  other  factors, 
such  as  differential  gene  expression  or  histone  modifications,  in  CT  positioning. 

x.3.2  A  link  between  gene  activity  and  positioning 

Generally,  frequently  transcribed  genes  congregate  in  euchromatin,  or  "active 
chromatin",  and  localize  in  the  nuclear  interior,  while  repressed  genes  reside  in 
heterochromatin,  or  "inactive  chromatin",  and  localize  in  the  nuclear  periphery 
(Bickmore  2013).  At  the  individual  gene  level  there  is  also  a  correlation  with  gene 
expression  and  nuclear  position  (Ferrai  et  al.  2010a;  Takizawa  et  al.  2008b).  This  is 
strikingly  shown  by  the  differential  position  of  the  active  and  inactive  Gfap  and  114 
alleles,  where  the  inactive  allele  is  in  a  more  peripheral  position  than  the  active  allele,  at 
least  in  the  vast  majority  of  cells  in  the  population  (Takizawa  et  al.  2008a).  Another 
example  of  this  is  CFTR ,  which  moves  from  the  nuclear  periphery  upon  activation,  while 
its  neighbouring,  still  silent,  genes  remain  at  the  periphery  (Zink  et  al.  2004a).  In  fact 
many  loci,  particularly  developmental^  regulated  loci,  reposition  concomitantly  with 
activation  orsilencing  (Ferrai  etal.  2010a;  Takizawa  etal.  2008b).  For  example,  Igh  and 
Mashl  move  from  the  edge  of  the  nucleus  to  a  more  internal  position  as  they  become 
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activated  during  lymphocyte  and  neuronal  differentiation,  respectively  (Kosak  et  al. 
2002;  Williams  et  al.  2006).  DamID,  employing  a  lamin  B  fusion  gene,  found  that  the 
FISH  studies  on  a  limited  number  of  genes  coincided  well  at  the  genome-wide  scale.  In 
mammalian  genomes  there  are  1,100-1400  lamin  associated  domains  (LADs),  which 
are  predominantly  gene-poor  and  lowly  transcribed/silent  regions  of  the  genome  and 
include  many  developmental^  regulated  regions  (Guelen  et  al.  2008;  Peric-Hupkes  et 
al.  2010;  Pickersgill  et  al.  2006;  Zullo  et  al.  2012).  LADs  vary  between  cell  types  and 
differentiation  stages,  and  a  subset  of  genomic  regions  change  association  with  the 
lamina  as  they  change  expression  (Peric-Hupkes  et  al.  2010).  Interestingly,  numerous 
genes  move  away  from  the  periphery  during  differentiation  before  they  change 
expression  (Peric-Hupkes  et  al.  2010),  suggesting  that  repositioning  is  not  a  mere 
consequence  of  activity  changes  and  involves  separate  mechanisms.  While  the  majority 
of  genes  that  associate  with  another  lamina  protein,  lamin  A/C,  are  also  predominantly 
transcriptionally  silent  (Kubben  et  al.  2012),  these  observations  should  not  be  taken  to 
mean  that  all  silent  genes  locate  to  the  nuclear  periphery  or  that  all  peripheral  genes  are 
silent  (fig  x.2).  Many  silent  genes  are  located  within  the  interior  of  the  nucleus.  As  an 
example,  even  though  Gfap  moves  to  a  more  internal  location  upon  activation,  even  it  is 
inactive  state  it  does  not  associate  with  the  nuclear  periphery  (Takizawa  et  al.  2008a). 
Additionally,  within  each  individual  nucleus  only  -30%  of  the  LAD  domains  are  at  the 
nuclear  lamina  (Kind  et  al.  2013).  Conversely,  numerous  active  genes  locate  to  the 
nuclear  periphery,  often  in  association  with  nuclear  pore  complexes  (Egecioglu  and 
Brickner  2011),  which  may  provide  support  for  the  "gene-gating”  hypothesis  put  forward 
by  Gunter  Blobel  nearly  three  decades  ago  (Blobel  et  al.,  1985). 
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Many  gene  loci  change  their  association  with  other  nuclear  sub-compartments 
correlating  with  changes  in  transcription.  These  include  nuclear  bodies  such  as 
transcription  factories,  splicing  speckles,  Cajal  bodies,  polycomb  bodies  and  blocks  of 
heterochromatin  (reviewed  in  (Bickmore  and  van  Steensel  2013;  Ferrai  etal.  2010a)).  In 
at  least  some  cases,  this  association  is  critical  for  gene  expression  (Khanna  et  al. 
2014).  Several  examples  have  been  reported  of  clustering  of  co-regulated  genes,  either 
in  their  active  or  inactive  state,  within  the  nuclear  space,  in  many  cases  at  nuclear 
bodies  (Brown  et  al.  2008;  Clowney  et  al.  2012;  Fanucchi  et  al.  2013;  Hakim  et  al.  2011; 
Rieder  et  al.  2014;  Schoenfelder  et  al.  2010;  Szczerbal  and  Bridger  2010;  Takizawa  et 
al.  2008a).  Active  genes  may  even  loop  away  from  the  bulk  of  their  CT  in  order  to 
associate  with  co-regulated  genes  in  "transcription  factories”  or  splicing  speckles 
(Branco  and  Pombo  2006;  Brown  etal.  2006;  Ferrai  etal.  2010b;  Matarazzo  etal.  2007; 
Morey  et  al.  2009).  Splicing  speckles  are  enriched  in  components  of  the  splicing 
machinery  and  RNA  polymerases.  Transcription  factories  are  sites  of  active 
transcription  within  the  nucleus,  which  are  enriched  in  RNA  polymerases.  Individual 
transcription  factories  have  been  suggested  to  be  able  to  allow  the  simultaneous 
expression  of  several  genes  (Ferrai  etal.  2010a). 

x.3.3  Beyond  gene  expression 

Changes  in  the  spatial  position  of  a  gene  may  also  occur  in  the  absence  of  changes  in 
gene  expression  (Harewood  et  al.  2010;  Morey  et  al.  2009;  Kubben  et  al.  2012; 
Kumaran  and  Spector  2008;  Meaburn  and  Misteli  2008;  Takizawa  et  al.  2008b;  Williams 
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et  al.  2006).  For  example,  when  the  Mashl  locus  moves  to  the  nuclear  interior  upon 
activation  during  murine  neuronal  cell  differentiation,  neighbouring  genes  also  become 
more  internally  positioned  despite  remaining  silent  (Williams  et  al.  2006).  Equally,  when 
the  active  Hoxb  locus  loops  out  of  its  CT  during  mouse  ES  cell  differentiation,  flanking 
genes  also  move  away  from  the  bulk  of  the  CT  without  an  accompanying  change  in 
expression  (Morey  et  al.  2009).  Moreover,  when  loci  are  forced  to  change  nuclear 
position,  either  by  artificial  tethering  to  the  nuclear  periphery  or  by  chromosome 
translocation,  some  genes  do  alter  expression,  while  others  are  unaffected  (Finlan  etal. 
2008;  Flarewood  et  al.  2010;  Kumaran  and  Spector  2008;  Reddy  et  al.  2008). 
Conversely,  another  set  of  genes  remain  in  the  same  nuclear  position  when  their  activity 
changes  (Flakim  et  al.  2011;  Flewitt  et  al.  2004;  Meaburn  and  Misteli  2008),  such  as 
Ifn3 ,  which  remains  at  the  periphery  in  mouse  T  helper  cells  when  activated  (Flewitt  et 
al.  2004).  To  complicate  things  further,  in  mouse  erythroid  differentiation,  2 -globin 
becomes  active,  localizes  with  active  polymerase  II  and  then,  at  a  later  time,  moves 
from  the  nuclear  periphery  simultaneously  with  polymerase  II  foci  concentrating  away 
from  the  periphery  (Ragoczy  et  al.  2006).  In  this  case,  the  repositioning  could  be  a 
consequence  of  changes  in  expression.  Flowever,  gene  expression  increases  further 
after  the  loci  repositions,  suggesting  that  positioning  patterns  may  aid  modulation  of 
expression  (Ragoczy  et  al.  2006). 

One  explanation  for  these  varying  observations  on  individual  genes  is  that  some  loci 
require  specific  positioning  for  expression  and  others  do  not,  and  that  the  latter  loci  are 
passively  moved  along  with  the  neighbouring  regions  for  which  positioning  is  important. 
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Alternatively,  other  factors  may  also  influence  spatial  positioning.  It  seems  the  genomic 
or  regional  neighbourhood  a  locus  resides  in  is  important  for  its  behaviour  (Takizawa  et 
al.  2008b).  In  agreement  with  this,  the  same  gene  can  behave  differently  between 
species  when  activated.  For  example,  during  erythroblast  differentiation  ±-globin  and  2- 
globin  loci  co-localize,  with  ±-globin  looping  out  of  its  CT  in  mice  when  the  gene 
activates,  but  neither  of  these  events  occur  in  humans  (Brown  et  al.  2008;  Brown  et  al. 
2006).  The  genomic  neighbourhood  for  these  two  genes  (e.g.  local  gene  density)  is 
divergent  between  mouse  and  human  (Brown  et  al.  2008).  These  data  suggest  that  the 
repositioning  and  resulting  proximity  of  these  two  genes  is  not  a  requirement  per  se  for 
the  regulation  of  particular  genes,  and  that  genomic  context  matters.  Further  evidence 
that  sequence  does  not  intrinsically  dictate  position  but  that  the  local  environment  plays 
a  role  comes  from  humammouse  hybrid  cells.  In  these  hybrid  cells,  many  human 
chromosomes  do  not  assume  the  same  position  in  the  mouse  nucleus  as  they  do  in  the 
human  cells  they  originate  from  (Meaburn  et  al.  2008).  Such  comparisons  across 
multiple  species  and  assessing  the  effects  of  manipulating  the  nuclear  environment  may 
shed  light  on  the  factors  that  are  important  in  defining  positioning  patterns. 

Transcription  is  only  one  of  many  functions  the  nucleus  carries  out.  While  gene  activity 
does  not  fully  account  for  genome  organization,  it  could  be  that  other  nuclear  activities, 
either  alone  or  in  combination  with  gene  expression,  influence  positioning  patterns.  For 
example,  replication  timing,  which  in  itself  is  linked  to  gene  expression,  correlates  with 
nuclear  positioning  patterns  (Meaburn  et  al.  2008).  Moreover,  in  yeast  cells,  broken 
chromosome  ends  move  to  the  nuclear  periphery  (Nagai  et  al.  2008).  Although  in 
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mammalian  cells  the  position  of  broken  DNA  ends  are  generally  positionally  stable 
(Roukos  and  Misteli  2014),  large  scale  DNA  damage  does  seem  to  induce  both 
decondensation  of  chromatin  (Kruhlak  et  a I.,  2006;  Dellaire  et  a I.,  2009)  and  a  re¬ 
organization  of  the  genome  (Mehta  et  al.  2013).  For  example  both  UV-laser  induced 
DNA  damage  (Kruhlak  et  al.,  2006)  and  etoposide-induced  DNA  DSBs  (Dellaire  et  al., 
2009)  can  trigger  large  scale  decondensation  of  chromatin.  In  another  example,  five 
chromosomes  were  shown  to  reposition  in  fibroblasts  cells  upon  treatment  with  global 
DNA  damaging  agents,  including  gene-rich  H S A17,  19  and  20  relocating  to  the  nuclear 
periphery  (Mehta  et  al.  2013). 

Chromosome  organization  may  also  play  a  structural  role  by  contributing  to  the  function 
of  the  organ  within  which  the  individual  cells  reside.  For  example,  a  rather  surprising 
apparent  role  for  genome  organization  with  respect  to  cellular  function  has  been 
identified  in  the  retina.  In  rod  cells  of  nocturnal  mammals  there  is  an  inversion  of  the 
position  ofeu-  and  heterochromatin,  with  heterochromatin  locating  in  the  center  of  cells 
(Solovei  etal.  2009).  This  evolutionary  adaptation  is  important  for  filtering  low  levels  of 
light  through  nuclei  to  the  photoreceptors  to  optimize  night  vision  (Solovei  et  al.  2009). 
Surprisingly,  this  massive  reorganization  of  chromatin  does  not  affect  transcriptional 
output  (Solovei  et  al.  2009). 

Despite  the  ample  evidence  of  the  existence  of  chromosomal  movement,  it  remains 
unclear  what  mechanisms  move  chromatin  in  the  nucleus.  Several  mechanisms  have 
been  suggested.  For  example,  passage  though  mitosis  and  particularly  early  Gi  has 
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been  associated  with  localizing  some  regions  to  the  nuclear  periphery,  such  as  HSA18 
as  cells  re-enter  proliferation  (Bridger  et  al.  2000).  Similarly,  the  subset  of  LAD  regions 
binding  to  the  nuclear  periphery  are  reset  during  early  Gi,  and  are  a  different  subset 
than  the  mother  cell  (Kind  et  al.  2013).  Moreover,  tethering  of  a  reporter  array  to  the 
periphery  via  lamin  B  also  required  passage  through  mitosis  (Kumaran  and  Spector 
2008).  However,  the  observed  repositioning  of  loci  in  quiescence  (Bridger  et  al.  2000; 
Meaburn  and  Misteli  2008),  by  its  very  definition,  cannot  involve  mitosis  and  early  Gi.  In 
fact,  the  repositioning  of  whole  chromosomes  in  quiescent  cells  is  remarkably  quick, 
occurring  within  15  min  after  cells  are  placed  into  low  serum  conditions,  and  requires 
ATP/GTP  and  nuclear  myosin  and  actin  (Mehta  etal.  2010).  Chromatin  modifiers  may 
also  play  a  role  is  the  positioning  of  the  genome.  In  line  with  this,  histone  deacetylase  3 
is  important  for  the  correct  peripheral  localization  of  certain  gene  loci  (Demmerle  et  al. 
2013;  Zullo  et  al.  2012).  Similarly,  histone  H3K9  di-  and  tri-methylation  by  Suv39H  and 
G9a  in  mammalian  cells  (Bian  etal.  2013;  Kind  etal.  2013)  and  by  MET-2  and  SET-25 
in  C.  elegans  (Towbin  et  al.  2012)  are  also  important  to  anchor  heterochromatin  at  the 
nuclear  periphery.  Non-coding  RNA  may  also  play  a  role  in  maintaining  certain  features 
of  CT  since  it  has  recently  been  discovered  that  non-coding  RNA  from  repetitive  regions 
of  the  DNA  coatCTs  and  appear  to  influence  chromatin  compaction  (Hall  etal.  2014).  In 
addition,  the  long  noncoding  RNA  Firre  is  necessary  for  the  co-localization  of  specific 
trans-chromosomal  loci  (Hacisuleyman  et  al.  2014).  The  potential  role  of  the  nuclear 
envelope  in  positioning  the  genome  will  be  discussed  below  (Section  x.4.1). 


x.4  Genome  organization  and  disease 
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The  fact  that  genome  organization  is  linked  to  correct  cellular  function  is  highlighted  by 
the  findings  that  the  genome  is  often  reorganized  in  disease  states.  Disease  generally 
does  not  result  in  global  nuclear  re-organization  and  instead  leads  to  repositioning  of 
subsets  of  genome  regions.  Studying  genome  positioning  patterns  in  disease  is  giving 
deeper  insight  into  what  regulates  genome  organization,  may  provide  clues  to  disease 
mechanisms  and  is  paving  the  way  for  spatial  positioning  to  be  used  as  a  diagnostic 
tool. 

x.4.1  The  nuclear  envelope,  laminopathies  and  genome  organization 
Laminopathies  are  a  group  of  rare  diseases  caused  by  mutations  in  nuclear  envelope 
(NE)  proteins.  The  most  prominent  of  these  diseases  result  from  mutations  in  lamin  A/C 
and  include  Emery-Dreifuss  muscular  dystrophy  (EDMD)  and  the  premature  aging 
disease  H utchison-G ilford  progeria  syndrome  (HGPS)  (Burke  and  Stewart 2013;  Dittmer 
and  Misteli  2011).  The  organization  of  CTs  in  proliferating  fibroblasts  from  laminopathy 
patients  resembles  that  of  quiescent  normal  fibroblasts  cells,  with  HSA13  and  18  no 
longer  positioned  at  the  nuclear  periphery  and  HSA10  relocating  to  the  periphery 
(Meaburn  et  al.  2007b;  Mehta  et  al.  2011;  Mewborn  et  al.  2010).  These  altered 
positioning  patterns  can  be  reversed.  Treatment  of  HGPS  fibroblasts  with 
farnesyltransferase  inhibitors  (FTIs),  which  prevent  the  accumulation  of  mutant  lamin  A 
at  the  NE,  rescues  CT  positioning  back  to  normal  (Mehta  etal.  2011).  Interestingly,  the 
reorganization  of  CTs  is  similar  for  most  laminopathy  patients,  irrespective  of  where  the 
mutation  maps  to  on  the  lamin  A/C  gene  (LMNA),  or  what  disease  the  patient  suffers 
from  (Meaburn  et  al.  2007a;  Mewborn  et  al.  2010).  The  exceptions  to  this  being  LMNA 
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mutations  delta303  and  D596N,  where  HSA13  is  more  tightly  associated  with  the 
nuclear  periphery  (Mewborn  et  al.  2010).  Not  all  chromosomes  change  their  association 
with  the  NE  in  laminopathy  fibroblasts,  however.  Similar  to  quiescent  control  cells, 
HSA4  and  X  remain  at  the  nuclear  periphery  of  laminopathy  cells  (Meaburn  et  al. 
2007a;  Mehta  et  al.  2011).  That  said,  the  conformation  of  each  chromosome  does 
appear  to  change,  at  least  in  late  passage  HGPS  cells.  Using  Hi-C,  there  is  a  loss  of 
spatial  separation  between  active  and  inactive  chromosomal  domains  in  late  passage, 
senescing  HGPS  fibroblasts  (passage-19),  but  not  in  a  proliferating  culture  of  HGPS 
(passage-17)  (McCord  et  al.  2013).  Since  senescing  normal  fibroblasts  were  not  used 
as  a  control,  it  remains  unclear  to  what  extent  these  changes  are  associated  with 
senescence  and  what  changes  are  directly  linked  to  HGPS.  Interestingly,  changes  in 
H3K27me3  and  lamin  A/C  binding  patterns  were  noted  to  occur  before  the  large  scale 
changes  in  chromatin  contact  frequencies  in  these  cells,  suggesting  these  are  important 
determinants  of  chromatin  organization  (McCord  etal.  2013). 

The  observations  in  laminopathies  are  in  line  with  findings  on  the  role  of  various  NE 
proteins  in  genome  organization.  In  mouse  fibroblasts,  the  position  of  MMU18  to  the 
nuclear  periphery  is  dependent  on  lamin  Bl,  while  the  central  position  of  MMU19  is  not 
(Malhas  et  al.  2007).  As  with  lamins,  several  NE  transmembrane  proteins  (NETs) 
influence  the  position  of  a  specific  subsets  of  chromosomes  (Zuleger  et  al.  2013). 
NET29  and  NET39  have  a  role  in  locating  HSA5  and  HSA13  to  the  nuclear  periphery, 
whereas  expression  of  NET5,  NET45  and  NET47  position  HSA5  but  not  HSA13  to  the 
periphery.  Moreover,  the  presence  of  NET47  actually  reduced  HSA13's  association  with 
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the  nuclear  periphery.  On  the  other  hand,  the  internal  position  of  HSA19  and  HSA17  are 
not  influenced  by  presence  of  these  NETs  (Zuleger  et  al.  2013). 

Given  that  the  NE  of  different  cell  types  is  composed  of  distinct  complements  of  NE 
proteins,  the  variations  between  different  NE  proteins  and  the  chromosomes  they 
influences  may  account,  in  part,  for  the  tissue-specificity  of  genome  organization, 
(Solovei  et  al.  2013;  Wong  et  al.  2014).  For  example,  the  expression  of  some  NETs, 
including  the  five  NETs  mentioned  above,  are  either  restricted  to  only  certain  tissues  or 
exhibit  a  wide  range  of  expression  levels  between  tissue  types  (Zuleger  et  al.  2013).  In 
keeping  with  this,  the  positioning  pattern  of  HSA5  in  liver  cells  could  be  transformed  into 
that  of  kidney  cells  by  modifying  the  expression  of  NETs  so  that  the  NE  better 
resembled  the  kidney  NE  (Zuleger  et  al.  2013).  The  differential  influence  of  various  NE 
proteins  between  different  cell  types  is  also  highlighted  in  laminopathies.  H S A13  and  18 
remain  peripheral  in  laminopathy  patient  lymphoblastiod  cell  lines,  a  cell  type  which 
does  not  require  lamin  A/C  (Boyle  et  al.  2001;  Meaburn  et  al.  2005).  Consistent  with 
this,  in  C.  elegans  an  EDMD  associated  mutant  lamin  (Y59C)  inhibited  the  release  from 
the  nuclear  lamina,  and  full  activation,  of  a  muscle  promoter  array  in  muscle  cells,  but  it 
did  not  interfere  with  an  intestinal  promoter  containing  array  relocating  to  the  nuclear 
interior  during  gut  development  (Mattout  et  al.  2011).  Along  with  the  tissue  specific 
impairment  of  genome  reorganization,  the  Y59C  worms  only  had  an  aberrant  muscle 
phenotype  (Mattout  etal.  2011).  In  the  most  detailed  study  to  date  of  differences  in  NE 
composition  between  tissues  and  the  consequences  on  spatial  organization,  NE 
proteins  lamin  B  receptor  (LBR)  and  lamin  A/C  were  compared  between  rod  nuclei  from 
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39  different  species  and  in  30  tissue  types  during  mouse  development  (Solovei  et  al. 
2013).  Lack  of  both  LBR  and  lamin  A/C  strongly  correlated  with  a  dramatic  inversion  of 
chromatin  distribution,  in  which  euchromatin  shifts  from  an  internal  position  to  the 
periphery.  In  wild-type  and  LBR  null  mice  inverted  chromatin  was  only  observed  in  cell 
types  lacking  both  LBR  and  lamin  A/C,  suggesting  that  these  proteins  have  roles  in 
targeting  heterochromatin  to  the  nuclear  periphery  (Solovei  et  al.  2013).  Indeed,  Pelger- 
Huet  anomaly,  a  disease  in  which  LBR  is  mutated,  is  characterised  by  an  altered 
heterochromatin  distribution  (Hoffmann  et  al.  2007).  Developmental^  regulated  loss  of 
LBR  has  also  been  implicated  in  the  clustering  of  silenced  olfactory  receptor  genes, 
away  from  the  nuclear  periphery,  in  olfactory  neurons  (Clowney  et  al.  2012). 

The  changes  to  genome  organization  in  cells  with  either  a  mutation  in  a  NE  protein  or 
with  an  altered  complement  of  NE  proteins  may  be  due  to  a  direct  role  of  NE  proteins  in 
tethering  chromatin.  However,  since  these  reorganizations  occur  in  concert  with 
misregulation  of  both  gene  expression  and  histone  modifications  (Malhas  et  al.  2007; 
McCord  et  al.  2013;  Mewborn  et  al.  2010;  Scaffidi  and  Misteli  2005;  Shumaker  et  al. 
2006),  it  cannot  be  ruled  out  that  that  the  repositioning  events  are  indirectly  related  to 
altered  NE  structure  or  function.  Of  course,  direct  roles  via  physical  interaction  and 
indirect  roles  of  NE  proteins  in  chromatin  positioning  are  not  mutually  exclusive.  It 
seems  likely  that  genome  repositioning  events  are  a  result  of  a  release  from  tethering  to 
the  NE  as  well  as  due  to  changes  in  epigenetic  modifications  and  gene  expression  that 
stem  from  alterations  in  NE  interactions  with  other  proteins.  Several  lines  of  evidence 
support  a  direct  tethering  role  for  NE  proteins.  These  include  the  observation  that  NE 
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proteins,  such  as  lamins  and  LBR,  can  directly  bind  to  DNA  and  chromatin  (Dittmer  and 
Misteli  2011;  Makatsori  et  al.  2004).  Moreover,  HSA13  and  18  are  mis-positioned  in 
asymptomatic  carriers  of  LMNA  mutation  R527H+/-  (Meaburn  et  al.  2007a),  where 
presumably  gene  expression  patterns  and  chromatin  modifications  are  similar  to  control 
cells.  In  LMNA  E145K  mutant  fibroblasts  centromeres  and  telomeres  are  mis-localized 
only  in  nuclei  that  are  lobulated  (Taimen  et  al.  2009).  Live  cell  imaging  revealed  that  the 
reorganization  of  the  centromeres  and  telomeres  and  lobulation  occurred  together  as 
the  NE  reformed  after  mitosis,  suggesting  an  altered  attachment  of  centromeres  and 
telomeres  to  lamins  (Taimen  et  al.  2009).  In  further  support  of  a  direct  tethering  role,  in 
mouse  cardiac  myocytes,  knock-down  of  lamin  A/C  caused  a  subset  of  genes  to  move 
away  from  the  nuclear  periphery,  in  the  absence  of  a  change  of  expression  for  that  gene 
(Kubben  et  al.  2012).  It  may  be  that  several  NE  proteins  are  required  to  work  in  concert 
to  tether  chromatin  to  the  NE.  At  least  in  rod  cells,  LEM  domain  proteins,  such  as 
emerin,  are  required  with  lamin  A/C  to  tether  chromatin  to  the  periphery  (Solovei  et  al. 
2013).  Conversely,  LBR  does  not  seem  to  require  a  mediator  to  tether  heterochromatin 
(Solovei  et  al.  2013). 

x.4.2  Altered  genome  organization  in  other  non-cancerous  diseases 
Genome  reorganization  is  not  limited  to  diseases  with  altered  NE.  In  one  of  the  first 
studies  of  spatial  genome  organization  in  disease,  Borden  and  Manuelidis  established 
that  the  centromere  of  HSAX  is  relocated  away  from  the  nuclear  periphery  or  the 
periphery  of  nucleoli  in  the  epileptic  focus  of  brain  tissue  from  patients  with  chronic 
uncontrolled  seizures.  Conversely,  loci  mapping  to  lq  12,  9q  12  and  Yql2  remained 
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proximal  to  the  NE  and/or  nucleolus  in  most  patients  (Borden  and  Manuelidis  1988). 
Some  diseases  also  have  an  increased  clustering  of  genomic  loci.  In  cheek  cells  from 
Alzheimer's  patients  there  is  an  altered  telomere  aggregation  and  clustering,  possibly  as 
the  result  of  shortened  telomeres  and  telomere  dysfunction  (Mathur  et  al.  2014)  and  in 
Down  syndrome  there  is  an  increase  clustering  of  HSA21  (Paz  etal.  2013). 

Furthering  a  link  between  spatial  genome  positioning  and  histone  modifications,  gene 
repositioning  occurs  in  a  disease  associated  with  altered  DNA  methylation.  Mutations  in 
DNA  methyltransferase  3B  lead  to  immunodeficiency  centromeric  instability  facial 
anomalies  (ICF)  syndrome,  and  several  genomic  loci  are  subjected  to  DNA 
hypomethylation  and  thus  activation.  One  such  site,  SYBL1,  which  normally  escapes  X 
inactivation,  loops  out  of  the  inactive  X  (Xj)  in  female  ICF  cells  and  the  FISAY  CT  in  male 
ICF  cells,  but  it  does  not  loop  from  the  active  X  (Matarazzo  et  al.  2007).  The  normally 
methylated  and  silenced  neighbouring  gene  SPRY3  also  loops  out  of  the  Xj  CT,  but  not 
FISAY  in  ICF  cells  (Matarazzo  et  al.  2007).  Both  these  differences  point  to  the  local 
genomic  environment  as  an  influential  factor  on  positioning  (see  section  x.3.3). 

Altered  spatial  positioning  patterns  are  also  induced  in  cells  infected  with  either  viruses 
or  parasites.  For  example,  Epstein-Barr  virus  infection  of  lymphocytes  results  in  FISA17 
transiently  moving  closer  to  the  periphery  in  the  days  following  infection,  while  the 
position  of  FISA18  is  unaffected  (Li  et  al.  2010).  Similarly,  in  snail  Bge  embryonic  cells, 
infection  with  the  parasite  Shistosoma  manoni  results  in  temporal  repositioning  of  gene 
loci  (Knight  et  al.  2011).  Again,  the  effect  of  infection  varies  between  the  loci  studied. 
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Actin  shifted  toward  the  nuclear  periphery  within  30  minutes  of  infection,  in  the  absence 
of  a  change  in  gene  expression.  However,  in  these  cells,  gene  expression  was  altered 
at  both  earlier  (15min)  and  later  (2  hours)  times  after  infection.  Conversely,  ferritin  was 
displaced  from  the  nuclear  periphery,  peaking  at  5  hours  and  returning  to  a  peripheral 
positioning  by  24  hours,  matching  the  temporal  changes  in  its  expression.  This 
repositioning  is  dependent  on  an  active  infection,  and  not  simply  the  presence  of  a 
cellular  breach  or  foreign  entity  within  the  nuclei  since  irradiated  (non-functional) 
parasites  did  not  induce  repositioning  (Knight  et  al.  2011).  Interestingly,  both  HSA17 
and  actin  exhibited  cycling  of  positioning  to  the  periphery  after  infection,  with  both 
returning  to  the  periphery  again  after  internal  positioning  had  been  restored  (Knight  et 
al.  2011;  Li  etal.  2010). 

x. 4. 3  Altered  genome  organization  in  cancer 

Alterations  in  spatial  genome  organization  have  been  prominently  linked  to  cancer 
(Meaburn  and  Misteli  2007;  Zink  et  al.  2004b).  Indeed,  distinctive  alterations  in 
chromatin  staining  patterns  and  nucleoli  size  and  number,  in  addition  to  nuclear  shape 
and  tissue  morphology,  are  used  by  pathologists  to  diagnose  cancer.  Yet  these  large 
scale  changes  at  the  chromatin  level  do  not  appear  to  reflect  a  global  change  in 
genomic  spatial  positioning  patterns.  For  example,  in  pancreatic  cancer  tissues,  HSA8 
remains  atthe  nuclear  peripherally,  although  the  CT  shape  changes  (Timme  et  al.  2011; 
Wiech  et  al.  2005).  Conversely,  HSA18  and  19  change  nuclear  location  in  several 
cancers  types,  including  cervical,  colon  and  some  thyroid  cancers  (Cremer  et  al.  2003; 
Murata  et  al.  2007).  Cancer  related  repositioning  is  not  limited  to  whole  chromosomes. 
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For  instance,  in  cervical  squamous  carcinoma,  the  BCL2  locus  repositions  in  a  BCL2 
positive  tumor,  but  not  in  a  BCL2  negative  tumor  (Wiech  et  al.  2009).  The  most 
extensively  studied  cancer  to  date,  with  respect  to  spatial  genome  positioning,  is  breast 
cancer.  4C  data,  and  FISH  validation,  revealed  IGFBP3  changes  long  range  interaction 
partners  in  breast  cancer  cell  lines  (Zeitz  et  al.  2013).  Moreover,  the  centromere  of 
H S A17  is  more  internally  positioned  in  breast  cancer  (Wiech  et  al.  2005)  and  several 
genes  reposition  in  breast  cancer  (Meaburn  et  al.  2009;  Meaburn  and  Misteli  2008).  In 
an  in  vitro  mammary  epithelial  cell  model  of  early  breast  cancer,  four  out  of  eleven 
tested  genes  ( AKT1 ,  VEGF,  BCL2  and  ERBB2 )  significantly  changed  intranuclear 
position,  but  not  their  gene  expression  level,  during  carcinogenic  transformation 
(Meaburn  and  Misteli  2008).  Similarly,  in  breast  cancer  tissues,  eight  of  20  tested  genes 
(J HE55 ,  HSP90AA1,  TGFB3,  MYC,  ERBB2,  FOSL2,  CSF1R  and  AKT1)  occupied 
significantly  different  positions  in  breast  cancer  tissues  compared  to  normal  tissues 
(Meaburn  et  al.  2009).  These  differences  were  not  due  to  inter-sample  variance  since 
these  genes  were  positioned  in  11-14  cancers  and  6-9  normal  tissues  and  even  though 
a  wide  range  of  breast  cancers  specimens  were  used,  these  genes  repositioned  in  64.3- 
100%  of  cancers  (MEABURN  09).  The  differences  in  gene  position  were  also  not  due  to 
numerical  chromosome  abnormalities,  and,  for  most  genes,  were  not  observed  in  the 
benign  breast  diseases  hyperplasia  and  fibroadenoma  or  among  normal  tissues 
(Meaburn  et  al.  2009).  These  observations  point  to  specific  repositioning  events  of  a 
subset  of  genes  in  breast  cancer  and  they  point  to  the  possibility  of  using  spatial 
positioning  of  the  genome  as  a  diagnostic  biomarker  for  cancer  detection  (Meaburn  et 
al.  2009;  Meaburn  and  Misteli  2008). 
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X.5  The  role  of  positioning  and  chromatin  in  translocation  formation 

X.5.1  Spatial  proximity  of  translocation  partners 

While  it  is  not  entirely  clear  why  the  genome  is  positioned  as  it  is  within  interphase  cell 
and  what  functional  role  it  has,  the  spatial  positioning  of  the  genome  has  emerged  as  an 
important  factor  in  determining  chromosome  translocation  partners  (fig  x.3)(Meaburn  et 
al.  2007b;  Misteli  2010;  Roukos  and  Misteli  2014).  It  was  initially  noticed  in  FISH 
experiments  that  frequent  translocation  partners  appeared  to  be  frequently  found  in 
close  spatial  proximity.  For  example,  peripheral  chromosomes  HSA4,  9,  13  and  18  are 
more  likely  to  translocate  with  each  other  than  with  internally  located  chromosomes 
(Bickmore  and  Teague  2002).  Moreover,  chromosomes  that  form  preferred  clusters 
within  certain  tissues,  such  as  MMU12,  14  and  15  in  splenocytes  or  MMU5  and  6  in 
hepatocytes,  are  more  likely  to  translocate  in  cancers  derived  from  those  tissues  than 
chromosomes  that  are  not  part  of  these  clusters  (Parada  et  al.  2004;  Parada  et  al. 
2002).  This  data  points  to  the  role  tissue  specific  genome  organization  may  play  in  the 
prevalence  of  certain  translocations  in  different  tissues.  In  keeping  with  proximal 
chromosomes  being  more  likely  to  be  translocation  partners,  the  amount  of 
intermingling  with  neighbouring  chromosomes  in  human  lymphocytes  correlates  with 
translocation  frequency  (Branco  and  Pombo  2006). 

Perhaps  more  importantly,  beyond  the  level  of  entire  chromosomes,  genes  that 
translocate  are  in  close  proximity,  in  the  cell  types  where  they  translocate  (Lukasova  et 
al.  1997;  Meaburn  et  al.  2007b;  Misteli  2010;  Neves  et  al.  1999;  Roix  et  al.  2003; 
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Roukos  and  Misteli  2014).  For  instance,  in  androgen  deprived  prostate  cell  nuclei, 
TMPRSS2,  ERG  and  ETV1,  which  are  common  translocation  partners  in  prostate 
cancers,  are  generally  found  in  distant  locations  (Lin  et  al.  2009;  Mani  et  al.  2009).  Upon 
androgen  stimulation  these  genes  reposition  to  become  proximal  neighbours, 
predisposing  them  as  translocation  partners,  as  indicated  by  a  dramatic  increase  in 
translocation  frequency  upon  irradiation  (Lin  et  al.  2009;  Mani  et  al.  2009).  Along  the 
same  lines,  in  anaplastic  large-cell  lymphoma  cells  the  close  proximity  of  ALK  and 
NPM1  facilities  the  formation  of  the  ALK-NPM1  gene  fusion  upon  irradiation  (Mathas  et 
al.  2009). 

The  caveat  of  these  correlation  studies  is  they  may  not  reflect  translocation  formation 
per  se,  since  only  translocations  that  carry  a  growth  advantage,  and  thus  expand  to  high 
levels  within  the  cancer  cell  population,  are  analyzed.  Moreover,  most  of  these  studies 
were  limited  to  a  few  genes  and  control  regions.  To  address  this,  several  genome-wide 
studies  have  been  performed  in  mouse  B  lymphocytes,  which,  crucially,  were  carried 
out  before  cellular  selection  skewed  the  translocation  detection  frequency. 
Translocation  frequencies  across  the  genome  were  measured  and  mapped  onto  linear 
chromosomes  (Chiarle  et  al.  2011;  Klein  et  al.  2011)  or  compared  to  Hi-C  (Zhang  et  al. 
2012)  or  4C  (Hakim  et  al.  2012;  Rocha  et  al.  2012)  data,  to  account  for  3D  genomic 
proximity.  These  studies  have  confirmed  a  correlation  between  spatial  proximity  and 
translocation  frequency.  Nevertheless,  a  low  frequency  of  translocations  from  genes 
that  were  distally  located  was  also  detected  (Zhang  et  al.  2012).  In  addition  to  spatial 
proximity,  not  surprisingly,  the  amount  of  DNA  damage  was  also  implicated  in 
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determining  frequency  of  specific  translocations  (Hakim  et  al.  2012).  These  correlative 
studies  were  recently  extended  by  direct  observation  of  translocation  formation  (Roukos 
et  al.  2013).  For  this,  inducible  DNA  double  strand  break  (DSB)  sites,  tagged  with 
coloured  fluorescently  labels,  were  integrated  into  different  mouse  chromosomes.  Upon 
DNA  damage,  the  broken  DNA  ends  where  then  tracked  using  live-cell  imaging  of 
individual  cells.  Consistent  with  both  the  FISH  and  genome-wide  data,  the  vast  majority 
of  translocation  forming  breaks  were  proximal  (within  2.5pm)  before  pairing,  however, 
-10%  of  translocations  were  formed  from  distant  breaks  (>4pm  apart)  (Roukos  et  al. 
2013). 

Given  the  fact  the  translocation  partners  tend  to  be  proximal  neighbours  before 
translocation,  it  is  not  surprising  that  many  fusion  chromosomes  resulting  from  the 
translocation  event  position  similarly  to  their  intact  counterparts  (Cremer  et  al.  2003; 
Croft  et  al.  1999;  Meaburn  et  al.  2007b;  Parada  et  al.  2002).  Interestingly,  the 
orientations  within  the  derivative  CT  tend  to  reflect  the  positioning  patterns  of  the 
individual  intact  chromosomes  (Cremer  et  al.  2003;  Croft  et  al.  1999),  for  example, 
within  the  CT  of  the  t(18;  19)  derivative  chromosome,  the  HSA18  DNA  is  more 
peripherally  located  than  the  HSA19  DNA,  similar  to  the  intact  chromosomes  (Croft  et 
al.  1999).  This  is  not  to  say  the  positions  of  all  translocation  chromosomes  are 
unaffected.  In  lymphoblastoid  cells  from  multiple  individuals  with  the  t(ll;22)(q23;qll) 
balanced  translocation,  the  HSA11  DNA  on  the  fusion  chromosome  is  more  centrally 
located  that  intact  HSA11.  Similarly,  the  HSA22  portion  of  the  fusion  chromosome  was 
more  peripherally  located  than  the  intact  HSA22  for  the  derivative  11  chromosome,  but 


26  |  P  a  g  e 


THE  FUNCTIONAL  NUCLEUS  (Eds  Dellaire,  Bazett-Jones,  Springer  DE) 


was  unaffected  when  it  was  part  of  the  derivate  22  chromosome  (Harewood  et  al. 
2010).  The  repositioning  of  gene  loci  after  translocation  can  also  be  variable  depending 
on  the  translocation,  with  some  translocated  loci  repositioning  and  others  not(Meaburn 
et  al.  2007b).  In  at  least  some  case  the  repositioning  reflects  alterations  to  the  local 
gene  density  around  the  fusion  gene  site  (Harewood  et  al.  2010;  Murmann  et  al.  2005) 
or  aberrant  expression  at  or  around  the  fusion  gene  (Ballabio  et  al.  2009;  Harewood  et 
al.  2010). 

x.5.2  Chromatin  organization  and  translocations 

While  the  spatial  arrangement  of  the  genome  in  vivo  contributes  to  the  formation  of 
translocations,  not  all  genomic  loci  are  equally  susceptible  to  translocation.  Recently, 
evidence  has  emerged  that  points  to  higher-order  chromatin  structure  as  a  key  player  in 
translocations  formation,  possibly  by  modulating  DNA  DSB  susceptibility  and  repair 
(Misteli  2010).  In  support,  genome-wide  sequencing  of  translocation  junctions  after 
DSBs  were  experimentally  induced  found  that  most  breakpoints  localize  within  or  near 
transcriptionally  active  regions  (fig.  x.3)  (Chiarle  et  al.  2011;  Klein  et  al.  2011).  The 
break  sites  were  enriched  for  histone  modifications  associated  with  active  chromatin, 
such  as  H3K4me3,  H3K36me3  and  H3  acetylation  (Klein  et  al.  2011).  A  link  between 
transcriptional  activity  and  translocation  frequency  has  been  demonstrated  in  prostate 
cancer,  where  the  common  translocation  fusion-gene  partners  TMPRSS2,  ERG,  and 
ETV1  contain  binding  sites  for  androgen  receptor  (AR),  a  potent  transcriptional 
activator.  Upon  androgen  treatment,  AR  was  co-recruited  with  topoisomerase-ll2  to 
break  sites,  leading  to  a  more  open  chromatin  conformation  and  persistent  DSBs 
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(Haffner  et  al.  2010;  Lin  et  al.  2009).  Similarly,  the  regions  near  translocation 
breakpoints  in  anaplastic  large  cell  lymphoma  were  found  to  be  transcriptionally 
activated  prior  to  translocation  (Mathas  etal.  2009).  Taken  together,  these  observations 
suggest  that  genomic  regions  with  altered  chromatin  structure  and  transcription  factor 
binding  may  be  more  susceptible  to  DSBs  that  lead  to  translocations. 

Chromatin  organization  within  the  nucleus  also  appears  to  determine  the  efficiency  of 
DNA  repair.  After  a  DSB  occurs,  damaged  chromatin  around  the  break  is  thought  to 
rapidly  decondense  to  facilitate  access  of  repair  machineries,  and  then  recondense  as 
the  repair  process  progresses.  These  events  are  orchestrated  by  chromatin  remodelers 
that  reposition  nucleosomes,  histone  chaperone  proteins  that  exchange  core  histones 
for  specific  histone  variants,  and  histone  modifying  enzymes  (Groth  etal.  2007).  Higher- 
order  chromatin  structure  can  drastically  influence  the  progression  of  repair,  possibly  by 
impeding  recruitment  of  these  proteins.  For  example,  radiation-induced  DSBs  in 
heterochromatin  were  observed  to  repair  more  slowly  than  breaks  in  euchromatin 
(Goodarzi  et  al.  2008).  In  the  case  of  translocations,  several  chromatin  modifications 
have  been  implicated  in  the  inaccurate  repair  of  DSBs.  In  prostate  cancer  cells  treated 
with  liganded  AR,  H3K79me2,  a  modification  associated  with  DNA  recombination,  was 
found  to  be  enriched  near  TMPRSS2  and  ERG  breakpoints.  Overexpression  of  H3K79- 
specific  methyltransferase  DOTH  significantly  increased  translocation  frequency  (Lin  et 
al.  2009).  Along  the  same  lines,  genome-wide  conversion  to  an  H4K20 
monomethylation  state  in  mice  led  to  defective  DSB  repair,  Ig  class-switch 
recombination,  and  IgH  translocations  (Schotta  etal.  2008).  Finally,  in  the  absence  of 
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H2AX,  a  histone  variant  that  is  immediately  phosphorylated  after  DSB  formation,  DSBs 
were  shown  to  persist  during  Ig  class-switch  recombination,  resulting  in  frequent 
translocations  (Franco  etal.  2006).  These  observations  point  to  a  potential,  still  poorly 
characterized,  role  of  chromatin  structure  and  histone  modifications  in  determining 
translocation  break  points. 

x.6  Summary 

It  has  become  increasingly  apparent  that  the  genome  is  non-randomly  organized  in 
interphase  nuclei  and  that  these  positioning  patterns  generally  correlate  with  nuclear 
function.  Several  recently  developed  technologies  are  driving  forward  our  understanding 
of  the  extent  and  relevance  of  spatial  genome  organization.  Combining  genome-wide 
strategies  and  high-throughput  siRNA  screens  with  FISH  will  enable  the  identification  of 
factors  that  directly  regulate  the  position  of  genomic  loci,  the  mechanisms  of  gene 
motion  and  will  give  a  clearer  understanding  of  the  functional  consequences  of 
positioning  patterns.  In  these  studies  it  will  be  important  to  consider  a  given  gene  in  the 
context  of  its  neighbourhood.  Comparisons  of  genome-wide  data  sets  merging  genome 
positioning  information  with  gene  expression,  epigenetics,  proteome,  non-coding  RNA 
expression  and  localisation,  in  multiple  biological  systems  (different  species,  cell  types, 
conditions,  diseases)  will  be  an  important  first  step.  Furthermore,  studying  the  genome 
in  live  cells  will  continue  to  give  important  insights  and  allow  the  hypotheses  generated 
from  fixed  cells  to  be  tested  in  real  time.  New  approaches,  such  as  clustered  regularly 
interspaced  short  palindromic  repeats  (CRISPR)  (Chen  et  al.  2013),  are  adding  to  the 
arsenal  of  techniques  for  live  cell  imaging  of  specific  regions  of  the  genome,  be  it 
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endogenous  loci  or  engineered  arrays.  The  time  is  ripe  to  integrate  data  from  of  the  new 
and  old  techniques  to  further  elucidate  important  properties  of  the  spatial  organization  of 
genomes. 
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Fig.  x.l.  Visualizing  genomic  loci.  (A)  Individual  chromosomes  form  discrete 
chromosome  territories  (CTs)  in  interphase  cells.  CTs  can  be  visualized  using 
fluorescence  in  situ  hybridization  (FISH),  in  this  case,  HSA11  is  shown  in  a  MCF10A 
breast  epithelial  cell  nucleus.  (B)  Individual  genes  assume  preferred  spatial  locations. 
FISH  reveals  the  location  of  MMP1  and  TGFB3  gene  loci  in  MCF10A  nuclei.  DAPI  to 
stain  DNA  in  blue. 
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Gene-poor  CTs  Gene-rich  CTs 


Fig.  x.2:  The  genome  is  non-randomly  organized  in  interphase  nucleus.  CTs 

occupy  preferred  positions  with  the  nucleus  both  relative  to  the  periphery  of  the  nucleus 
and  to  other  chromosomes.  Smaller  chromosomes  tend  to  be  more  internally  located. 
Gene-poor  chromosomes  have  a  preference  for  the  nuclear  periphery  and  gene-rich 
chromosomes  for  the  nuclear  interior,  and  chromosomes  with  intermediate  gene 
densities  are  intermediately  positioned.  The  nuclear  lamina  (NL)  is  associated  with 
inactive  chromatin,  whereas  the  nuclear  pore  complex  (NPC)  can  associate  with  active 
and  inactive  chromatin.  Even  within  a  CT,  gene-rich  and  gene  poor-chromatin  regions 
tend  to  be  spatially  separated,  with  genic  regions  tending  to  favor  both  the  edge  of  the 
CT  and  the  part  of  the  CT  which  faces  the  nuclear  interior. 
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Fig.  x.3.  Spatial  positioning  patterns  influence  translocation  frequency. 

T ranslocations  can  arise  from  DSBs  induced  by  cellular  stress  in  the  forms  of  genotoxic, 
oxidative,  replicative,  or  transcriptional  stress.  Illegitimate  joining  of  DSBs  can  result  in 
the  formation  of  translocations.  Proximal  chromosomes  A  and  B  translocate  at  higher 
frequency  than  distal  chromosomes  A  and  C,  or  B  and  C.  Chromatin  features  may  also 
predispose  certain  genomic  regions  to  translocation. 
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25  Abstract 

26 

27  In  eukaryotic  cells  the  genome  is  highly  spatially  organized.  Functional  relevance  of  higher  order 

28  genome  organization  is  implied  by  the  fact  that  specific  genes,  and  even  whole  chromosomes,  alter 

29  spatial  position  in  concert  with  functional  changes  within  the  nucleus,  for  example  with 

30  modifications  to  chromatin  or  transcription.  The  exact  molecular  pathways  that  regulate  spatial 

31  genome  organization  and  the  full  implication  to  the  cell  of  such  an  organization  remain  to  be 

32  determined.  However,  there  is  a  growing  realization  that  the  spatial  organization  of  the  genome  can 

33  be  used  as  a  marker  of  disease.  W  hile  global  genome  organization  patterns  remain  largely  conserved 

34  in  disease,  some  genes  and  chromosomes  occupy  distinct  nuclear  positions  in  diseased  cells 

35  compared  to  their  normal  counterparts,  with  the  patterns  of  reorganization  differing  between  diseases. 

36  Importantly,  mapping  the  spatial  positioning  patterns  of  specific  genomic  loci  can  distinguish 

37  cancerous  tissue  from  benign  with  high  accuracy.  Genome  positioning  is  an  attractive  novel 

38  biomarker  since  additional  quantitative  biomarkers  are  urgently  required  in  many  cancer  types. 

39  Current  diagnostic  techniques  are  often  subjective  and  generally  lack  the  ability  to  identify  aggressive 

40  cancer  from  indolent,  which  can  lead  to  over-  or  under-treatment  of  patients.  Proof-of-principle  for 

41  the  use  of  genome  positioning  as  a  diagnostic  tool  has  been  provided  based  on  small  scale 

42  retrospective  studies.  Future  large-scale  studies  are  required  to  assess  the  feasibility  of  bringing 

43  spatial  genome  organization- based  diagnostics  to  the  clinical  setting  and  to  determine  if  the 

44  positioning  patterns  of  specific  loci  can  be  useful  biomarkers  for  cancer  prognosis.  Since  spatial 

45  reorganization  of  the  genome  has  been  identified  in  multiple  human  diseases,  it  is  likely  that  spatial 

46  genome  positioning  patterns  as  a  diagnostic  biomarker  may  be  applied  to  many  diseases. 

47 
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48  A  shortage  of  quantitative  cancer  biomarkers 

49 

50  Cancer  is  a  major  public  health  issue.  In  2012  alone,  across  the  globe  approximately  14.1  million 

51  people  were  diagnosed  with  cancer  and  8.2  million  people  lost  their  lives  to  it  (Torre  et  al 2015).  In 

52  developed  countries  a  third  to  a  half  of  the  population  will  be  diagnosed  with  cancer  during  their 

53  lifetime  (Ahmad  et  al.,  2015;Siegel  et  al.,  2016).  Not  only  is  cancer  a  leading  cause  of  morbidity,  it  is 

54  also  a  leading  cause  of  mortality.  In  fact,  only  heart  disease  accounts  for  more  deaths  than  cancer; 

55  although  cancer  is  the  most  frequent  cause  of  death  for  40-79  year  olds  (Siegel  et  al .,  2016).  Cancer  is 

56  a  heterogeneous  collection  of  diseases.  The  most  prevalent  cancers  are  lung,  breast,  prostate  and 

57  colorectal  carcinomas,  which,  when  combined  account  for  -43%  of  all  new  cancer  diagnoses  (Torre 

58  et  al.,  2015;Siegel  et  al.,  2016).  These  four  cancers  are  also  leading  causes  of  cancer- related 

59  mortality,  contributing  -40%  of  the  total  (Torre  et  al .,  2015;Siegel  et  al .,  2016).  Even  cancers  derived 

60  from  the  same  organ  can  be  quite  heterogeneous  from  each  other,  and  are  further  stratified  based  on 

61  morphological  patterns,  clinical  features  and/or  molecular  characteristics.  This  sub-grouping  is 

62  important  clinically  because  the  appropriate  treatment  and  the  risk  of  recurrence  or  lethality  differ 

63  between  subgroups.  For  example,  HER2  (also  known  as  ERBB2)  positive  breast  cancers  have  a 

64  poorer  prognostic  outlook  than  luminal  breast  cancers  (Sorlie  et  al.,  2001;Ferte  et  al.,  2010),  and 

65  many  HER  2-positive  breast  cancers  respond  well  to  trastuzumab  (Herceptin)  treatment,  while 

66  endocrine  therapy  is  an  effective  treatment  option  for  the  majority  of  luminal  breast  cancers  (Ferte  et 

67  al.,  2010;Schnitt,  2010). 

68 

69  A  key  to  the  successful  treatment  of  any  disease  is  accurate  diagnosis,  preferably  at  an  early  stage  in 

70  disease  progression  when  treatments  are  generally  more  effective  and  less  aggressive.  However,  early 

71  detection  of  cancer,  particularly  by  population- based  cancer  screenings  such  as  mammography  or 

72  serum  prostate-specific  antigen  (PSA)  levels,  often  comes  at  the  cost  of  overdiagnosis  (Welch  and 

73  Black,  2010;B leyer  and  Welch,  2012;Sandhu  and  Andriole,  2012).  Overdiagnosis  is  the  detection  of 

74  indolent  cancers  that  will  remain  asymptomatic  during  the  patient's  lifetime  and,  as  such,  do  not 

75  require  treatment.  M  ost  commonly,  cancer  diagnosis  is  made  following  histological  examination  of 

76  biopsy  or  surgical  tissue  specimens.  Unfortunately,  in  most  instances  there  is  a  limit  to  the  prognostic 

77  information  that  can  be  gleaned  from  histological  analysis,  and  the  pathologist  is  unable  distinguish 

78  indolent  from  aggressive  cancers  with  a  high  degree  of  certainty.  Presently,  due  to  a  lack  of 

79  additional  markers  to  identify  aggressive  cancer,  many  overdiagnosed  patients  are  being  overtreated 

80  since  they  are  receiving  unnecessary  therapies.  Not  only  are  these  patients  receiving  no  therapeutic 

81  benefit,  as  their  cancer  would  never  have  significantly  progressed,  they  are  actually  being  harmed  by 

82  the  cancer  treatment  itself,  because  treatments  can  have  both  short-term  and  long-term  effects  that 

83  cause  illness,  reduce  quality  of  life,  create  large  financial  burdens  (for  example  by  loss  of  income), 

84  and  can  even  lead  to  death  (Cooperberg  et  al.,  2010;Welch  and  Black,  2010;B leyer  and  Welch, 

85  2012;Sandhu  and  Andriole,  2012).  It  has  become  a  major  focus  of  research  to  find  markers  that  can 

86  distinguish  between  indolent  and  aggressive  cancers,  to  aid  the  determination  of  the  best  possible 

87  treatment  plan  for  the  patient. 

88 

89  It  is  currently  difficult  to  accurately  assess  how  many  cancers  are  overdiagnosed  as  directly 

90  measuring  overdiagnosis  would  involve  monitoring  the  progress  of  the  cancer  and  the  cause  of  death 

91  of  cancer  patients,  without  treating  the  individuals.  The  reported  rate  of  overdiagnosis  is  also 

92  dependent  on  the  precise  definition  of  "clinically  relevant"  cancer.  Nevertheless,  for  breast  and 

93  prostate  cancer,  estimates  suggest  overdiagnosis  is  in  the  range  of  22-67%  of  cancers  (Draisma  et  al., 

94  2009;Welch  and  Black,  2010;B leyer  and  Welch,  2012;Sandhu  and  Andriole,  2012;M  i I ler  et  al., 

95  2014).  Using  prostate  cancer  mortality  as  an  endpoint,  as  many  as  84%  of  PSA  screen-detected 


Spatial  Genome  Organization  based  Diagnostics 


96  prostate  cancers  may  be  overdiagnosed  (McGregor  et  al 1998).  Breast  and  prostate  cancers  are 

97  highly  prevalent,  with  almost  1.7  million  women  and  1.1  million  men  worldwide  diagnosed  with 

98  these  cancers,  respectively,  in  2012  (Torre  et  al.,  2015).  In  the  USA,  where  population-based 

99  screening  for  breast  and  prostate  cancer  is  common,  over  240,000  women  and  more  than  180,000 

100  men  per  year  are  diagnosed,  respectively  (Siegel  et  al.,  2016).  Given  these  numbers,  even  taking  the 

101  low  estimate  of  overdiagnosis  of  1  in  5  cancers  additional  prognostic  markers  could  benefit  hundreds 

102  of  thousands  of  patients  every  year  and  reduce  health  care  costs  considerably. 

103 

104  Histological  assessment  of  tissue  specimens  can  be  beneficial  in  the  clinical  setting  to  reduce 

105  overtreatment.  For  example,  pathologists  classify  histological  patterns  within  prostate  cancer  tissue 

106  according  to  the  Gleason  grading  system,  which  broadly  correlates  with  the  differentiation  status  of 

107  the  cancerous  tissue  (M  ontironi  et  al.,  2005;E pstei n  et  al.,  2016).  Gleason  scores,  particularly  when 

108  used  in  combination  with  other  clinical  factors,  including  PSA  level  and  clinical  tumor  stage,  are  the 

109  strongest  markers  of  prostate  cancer  prognosis  and  progression  currently  available  (DAmico  et  al., 

110  1998;M  ontironi  et  al.,  2005;Cooperberg  et  al.,  2009;Chang  et  al.,  2014;E pstei n  et  al.,  2016).  Part  of 

111  the  strength  of  the  Gleason  scoring  system  is  that  it  takes  into  account  intra-individual  tumor 

112  heterogeneity  (Van  der  Kwast,  2014;E pstei n  et  al.,  2016).  A  low  Gleason  score  (Gleason  score  6  or 

113  below)  is  highly  indicative  of  an  indolent  cancer,  and  often  monitoring  the  tumor  (watchful- 

114  waiting/active  surveillance)  is  recommend  in  place  of  treatment  (Thompson  et  al.,  2007).  However, 

115  additional  markers  are  required  as  both  under-  and  over-treatment  remain  an  issue  (Cooperberg  et  al., 

116  2010).  Additionally,  histological  analysis  of  tissues  is,  by  its  very  nature,  subjective,  which  can  lead 

117  to  inter-  and  intra-observer  variations.  T aking  the  example  of  prostate  cancer  again,  two  pathologists 

118  will  agree  on  the  Gleason  score  36-81%  of  the  time  (M  ontironi  et  al.,  2005).  Similarly,  the  same 

119  pathologist  will  derive  an  identical  Gleason  score  for  the  same  tissue  specimen  43-78  %  of  the  time 

120  (M  ontironi  et  al.,  2005).  M  ost  frequently,  the  difference  in  score  varies  by  a  single  Gleason  grade, 

121  suggesting  these  small  variations  would  make  little  difference  to  most  patients.  However,  for  men 

122  with  Gleason  score  6  or  7  cancers  this  could  mean  the  difference  between  receiving  treatment  or  not. 

123  The  addition  of  quantitative  diagnostic  makers,  to  be  used  alongside  histological  analysis,  would 

124  reduce  inter-  and  intra-observer  variations,  and  thus  reduce  overtreatment. 

125 

126  T  argeting  the  nucleus  in  the  search  for  clinically  relevant  biomarkers 

127 

128  Molecular  markers,  by  their  very  nature,  tend  to  be  highly  quantifiable,  and  are  a  good  potential 

129  source  of  diagnostic  and  prognostic  cancer  biomarkers  (Ferte  et  al.,  2010).  However,  few  molecular 

130  markers  are  currently  in  routine  clinical  practice  (Ferte  et  al.,  2010).  An  exciting  area  for  exploration 

131  to  identify  novel  biomarkers  is  cancer-related  changes  in  the  architecture  of  the  cell  nucleus  (Veltri 

132  and  Christudass,  2014).  This  is  not  without  precedence  and  changes  in  nuclear  structure,  such  as 

133  nuclear  size,  shape,  prominence  of  nucleoli  and  chromatin  texture,  have  long  been  used  by 

134  pathologists  as  part  of  their  criteria  to  identify  cancerous  cells  (Zink  et  al.,  2004b;V eltri  and 

135  Christudass,  2014).  Moreover,  identifying  chromosomal  translocations  is  important  for  diagnosing 

136  hematological  cancers,  as  specific  translocations  characterize  the  type  of  leukemia  or  lymphoma,  are 

137  associated  with  distinct  clinical  features  and  are  often  indicative  of  the  appropriate  treatment 

138  (M  itelman  et  al.,  2007).  For  example,  chronic  myeloid  leukemia  is  diagnosed  after  the  cytogenetic 

139  detection  of  t(9;22)(q34;qll),  the  Philadelphia  chromosome,  which  results  in  a  BCR-ABL  fusion 

140  protein.  Tyrosine  kinase  inhibitors  (I matinib/G leevec)  inhibit  the  activity  of  the  resultant  oncogenic 

141  fusion  protein  and  use  of  this  targeted  therapy  results  in  exceptionally  high  rates  of  remission  for 

142  chronic  myeloid  leukemia  patients  (Hehlmann  et  al.,  2007).  Fluorescence  in  situ  hybridization 

143  (FISH),  a  technique  used  to  visualize  selected  sequences  of  DNA  within  interphase  nuclei  or  on 
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144  mitotic  chromosomes,  is  one  method  used  in  clinical  practice  to  detect  the  presence  of  specific 

145  translocations  (Muhlmann,  2002;Zink  et  al 2004b;Hehlmann  et  al 2007).  FISH  is  also  used 

146  clinically  to  detect  other  chromosomal  aberrations  in  solid  and  hematological  cancers,  such  as 

147  amplifications  of  the  ERBB2  locus  in  breast  cancer,  to  aid  diagnosis  or  as  a  prognostic  marker 

148  (Muhlmann,  2002;Lambros  et  al.,  2007;H asti ngs,  2010).  Changes  in  gene  expression  profiles  can 

149  also  be  useful  diagnostically.  For  instance,  there  are  several  commercially  available  assays  with 

150  prognostic  value  for  various  sub-types  of  breast  cancer  based  on  the  gene  expression  profiles  of 

151  between  2  and  97  genes  (Dai  et  al.,  2015). 

152 

153  Beyond  gene  expression  changes  and  genomic  aberrations,  there  are  multiple  other  aspects  of  nuclear 

154  structure  and  function  that  are  deregulated  in  cancer  and  could  be  exploited  clinically.  For  example, 

155  alteration  in  nuclear  shape,  size,  global  levels  and  patterns  of  heterochromatin  and/or  histone 

156  modifications  during  carcinogenesis  have  been  shown  to  be  predictive  of  cancer  progression  (Zink  et 

157  al.,  2004b;N ielsen  et  al.,  2008;V el tri  and  Christudass,  2014).  Aberrant  expression  of  A-  and/or  B- 

158  type  I  ami  ns  is  a  common  feature  of  many  types  of  cancer,  including  lung,  breast,  prostate,  colorectal, 

159  skin  and  gut  carcinomas  (Broers  et  al.,  1993;Moss  et  al.,  1999;Venables  et  al.,  2001;W  i  1 1  is  et  al., 

160  2008;B elt  et  al.,  2011;Kong  et  al.,  2012;W azi r  et  al.,  2013;Broers  and  Ramaekers,  2014;Saarinen  et 

161  al.,  2015).  The  majority  of  lamin  proteins  form  the  nuclear  lamina,  which  underlies  the  nuclear 

162  envelope  (NE),  and  an  additional  pool  of  intranuclear  lamins  exists  (Dittmer  and  Misteli,  2011). 

163  Although  highly  variable  between  individual  cancers  and  cancer  sub-types,  a  broad  generalization 

164  suggests  cancers  with  lower  expression  levels  of  A-type  lamins  tend  to  have  poorer  outcomes  and 

165  more  aggressive  phenotypes  (Belt  et  al.,  2011;Wazir  et  al.,  2013;Saarinen  et  al.,  2015).  This  is  in  line 

166  with  findings  that  cells  with  a  deficiency  in  A-type  lamins  can  migrate  more  easily  though  narrow 

167  constrictions  (Davidson  et  al.,  2014),  which  may  aid  metastatic  progression,  and  that  reduced  lamin 

168  A  has  been  linked  to  a  higher  cellular  proliferation  rate  (Venables  et  al.,  2001).  However,  the 

169  associations  between  reduced  lamin  expression  and  prognosis  tend  to  be  weak  or  based  on  a  low 

170  number  of  patients,  and  there  are  conflicting  studies  that  find  higher  levels  of  A-type  lamin 

171  expression  to  be  predictive  of  a  poor  outcome  (W  i  1 1  is  et  al.,  2008;Kong  et  al . ,  2012).  Since  difference 

172  in  lamin  expression  can  vary  between  sub-groups  of  cancers,  for  example  between  small  cell  lung 

173  cancer  and  non-small  cell  lung  cancer  (Broers  et  al.,  1993),  future  studies  with  more  defined  cohorts 

174  of  patients  may  prove  enlightening.  It  is  not  only  changes  in  expression,  mis-localization  of  lamins  to 

175  the  cytoplasm  (Broers  et  al.,  1993;M  oss  et  al.,  1999),  or  the  nucleolus  (Venables  et  al.,  2001)  have 

176  also  been  observed  in  some  cancer  specimens.  The  reduction  and/or  loss  of  lamin  isoforms  may  be  an 

177  early  event  in  carcinogenesis  as,  at  least  in  cervical  cancer,  it  can  occur  before  neoplasia  is  detectable 

178  (Capo-Chichi  et  al.,  2016).  Other  nuclear  bodies,  including  the  nucleolus  and  PM  L  bodies,  are  also 

179  disrupted  in  cancer,  with  the  specific  aberration  detected  depending  on  the  type  of  cancer  (Zink  et  al., 

180  2004b). 

181 

182  While  loss  of  genomic  integrity,  such  as  mutations,  amplifications,  deletions  and  translocations,  are 

183  hallmarks  of  cancer  and  cancer  progression  (M  itelman  et  al.,  2007;Hanahan  and  Weinberg,  2011),  an 

184  additional  feature  of  the  genome  is  emerging  as  a  potential  cancer  biomarker  -  namely  the  spatial 

185  organization  of  the  genome  within  the  interphase  nucleus. 

186 

187  Non-random  spatial  organization  of  the  genome  in  interphase  nuclei 

188 

189  The  genome  is  highly  spatially  organized  within  the  nucleus,  with,  for  the  most  part,  each  gene  and 

190  chromosome  occupying  preferred  nuclear  positions  (Cremer  and  Cremer,  2001;Ferrai  et  al., 

191  2010;Dekker  and  Misteli,  2015;Meaburn  et  al.,  2016a).  Although  the  spatial  organization  of  the 
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192  genome  is  non-random  it  does  not  mean  that  each  cell  within  a  population  has  an  identical 

193  positioning  pattern.  In  fact,  spatial  genome  organization  is  probabilistic  in  nature  with  considerable 

194  cel l-to-cel I  variability,  and  while  in  the  majority  of  cells  a  certain  position  (e.g.  internal,  proximal  to  a 

195  nuclear  speckle,  etc.)  will  be  favored,  within  the  population  of  cells  a  specific  genomic  region  can  be 

196  found  in  any  position  within  the  cell  (Parada  et  al .,  2003;B olzer  et  al .,  2005;Goetze  et  al ., 

197  2007;Dekker  and  M  isteli,  2015;K ind  et  al.,  2015). 

198 

199  The  nonrandom  spatial  positioning  of  genomic  loci  can  be  characterized  in  several  ways,  the  most 

200  common  being  radial  positioning  and  relative  positioning  (Figure  1).  Radial  positioning  describes  the 

201  position  of  a  locus  along  a  central  axis  between  the  nuclear  periphery  and  the  center  of  a  nucleus.  For 

202  instance,  in  many  cell  types  human  chromosome  18  preferentially  locates  to  the  nuclear  periphery, 

203  and,  conversely,  chromosome  19  is  positioned  centrally  (Croft  et  al.,  1999;B oyle  et  al.,  2001;Cremer 

204  et  al.,  2003).  On  the  other  hand,  relative  positioning  measures  the  position  of  a  gene  or  chromosome 

205  relative  to  another  nuclear  landmark,  such  to  another  gene  or  chromosome,  association  with  the  NE 

206  or  with  a  nuclear  body,  including  a  transcription  factory  or  a  splicing  speckle,  or  the  position  of  a 

207  gene  with  respect  to  the  rest  of  the  chromosome  it  is  located  on.  Relative  positioning  can  measure 

208  either  the  frequency  of  close  spatial  proximity  or  the  distance  between  the  two  objects.  As  an 

209  example  of  relative  positioning,  in  human  intermediate  erythroblasts  cells  the  ±-globin  gene  locus 

210  frequently  loops  out  of  its  chromosome  territory,  and  is  often  in  close  spatial  proximity  to  the  2- 

211  globin  gene  at  splicing  factor- enriched  nuclear  speckles  (Brown  et  al.,  2006).  T raditionally  radial  and 

212  proximal  positioning  are  measured  after  FISH.  More  recently,  biochemical  methods  have  been 

213  developed  to  map  the  proximity  of  a  locus  to  either  other  genomic  regions  (a  range  of  chromatin 

214  conformation  capture  techniques,  e.g,  3C,  4C,  5C,  HiC)  or  to  nuclear  structures  (e.g.  DamID,  ChlA- 

215  PET)  (van  Steensel  and  Dekker,  2010;Bickmore  and  van  Steensel,  2013;Dekker  and  M  isteli,  2015). 

216  These  traditional  and  new  methods  are  highly  complementary,  and  it  is  their  combined  use  that  will 

217  drive  forward  our  understanding  of  genome  organization.  Unlike  FISH,  the  biochemical  methods  are 

218  population- based  and  the  heterogeneity  inherent  in  positioning,  either  between  individual  cells  or 

219  between  the  two  alleles  of  a  gene  within  a  single  cell,  is  not  captured.  Conversely,  as  opposed  to 

220  many  of  the  biochemical  methods,  FISH  is  relatively  low-throughput  and  candidate  genes  are 

221  required  for  analysis. 

222 

223  C  orrelations  between  spatial  genome  organization  and  transcription 

224 

225  The  exact  molecular  mechanisms  that  regulate  the  spatial  organization  of  the  genome  are  currently 

226  not  well  elucidated.  M  oreover,  it  is  not  clear  if  the  spatial  organization  of  the  genome  is  important  to 

227  regulate  proper  nuclear  function  or  if  it  is  simply  a  byproduct  of  nuclear  processes.  Part  of  the 

228  difficulty  in  resolving  this  question  is  that  it  is  virtually  impossible  to  completely  separate  the  various 

229  nuclear  functions  from  each  other  and,  therefore,  determine  cause  and  effect.  It  is  also  likely  that 

230  different  genes  are  regulated  by  different  mechanisms;  for  every  rule  suggested  to  govern  spatial 

231  genome  organization  there  are  multiple  examples  of  exceptions.  That  said,  there  are  several  lines  of 

232  evidence  that  suggest  functional  importance  to  spatial  genome  organization.  The  first  hint  comes 

233  from  the  fact  that  gene-rich  regions  of  the  genome  are  spatially  separated  from  gene-poor  regions 

234  (Boyle  et  al.,  2001;Boutanaev  et  al.,  2005;Clemson  et  al.,  2006;Shopland  et  al.,  2006;Si monis  et  al., 

235  2006;Lieberman-Aiden  et  al.,  2009;B oyle  et  al.,  2011).  For  instance,  gene-rich  chromosomes  are 

236  generally  internally  located  whereas  gene-poor  chromosomes  are  enriched  at  the  nuclear  periphery 

237  (Boyle  et  al.,  2001)  and  within  individual  chromosomes  gene-poor  and  gene-rich  chromatin  are 

238  spatially  separate,  even  at  the  M  b  scale  (Boutanaev  et  al.,  2005;Shopland  et  al.,  2006;Lieberman- 

239  Aiden  et  al.,  2009).  This  suggests  that  the  spatial  organization  of  the  genome  is  functionally 
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240  important  for  gene  expression,  or  is  at  least  dictated  by  gene  expression.  This  hypothesis  is  supported 

241  by  the  observation  of  spatial  clustering  of  some  co-regulated  genes,  e.g.  tRNA  genes  (Thompson  et 

242  al .,  2003),  K LFl-regulated  genes  (Schoenfelder  et  al .,  2010),  STAT-regulated  genes  (Hakim  et  al ., 

243  2013)  or  olfactory  receptor  genes  (Clowney  et  al.,  2012).  Further,  genomic  regions  that  are  gene-poor 

244  and/or  contain  mostly  inactive  genes  tend  to  cluster  together,  for  example  at  the  nuclear  lamina, 

245  around  the  edge  of  the  nucleolus  or  with  peri-centromeric  heterochromatin  (Pickersgill  et  al., 

246  2006;Guelen  et  al.,  2008;Nemeth  et  al.,  2010;Peric-H upkes  et  al.,  2010;van  Koningsbruggen  et  al., 

247  2010;Kind  et  al.,  2013;W ijchers  et  al.,  2015).  Moreover,  the  spatial  organization  of  the  genome  is 

248  different  in  cells  with  differing  gene  expression  profiles,  for  example  the  genome  reorganizes  during 

249  differentiation  and  is  different  between  different  cell-  and  tissue-types  (Kosak  et  al.,  2002;Parada  et 

250  al.,  2004;Solovei  et  al.,  2009;Peric-H  upkes  et  al.,  2010;Foster  et  al.,  2012;Hakim  et  al.,  2013;Rao  et 

251  al.,  2014;Meaburn  et  al.,  2016b).  While  specific  loci  often  change  nuclear  positions,  there  is  not  a 

252  global  spatial  rearrangement  of  the  genome  between  cell  types.  Positioning  patterns  are  remarkably 

253  conserved  between  cell  types  and  stages  of  differentiation,  with  only  specific  genomic  regions  in 

254  alternative  positions.  For  instance,  in  human  fibroblasts  and  lymphocytes  most  chromosomes  are  in 

255  similar  radial  positions  and  only  the  position  of  chromosomes  8,  20  and  21  differ  (Boyle  et  al., 

256  2001;M  eaburn  et  al.,  2008).  Similarly,  of  the  genomic  regions  that  associated  with  the  nuclear  lamina 

257  in  mouse  embryonic  stem  cells  and  astrocytes,  approximately  20%  are  lamina-associated  in  only  one 

258  of  the  two  cell  types  (Peric-H  upkes  et  al.,  2010). 

259 

260  M  ost  saliently,  genes  can  relocate  to  different  nuclear  locations  with  changes  in  their  transcriptional 

261  output  (Fraser  and  Bickmore,  2007;Ferrai  et  al.,  2010).  For  example,  active  Gfap  and  114  alleles  are 

262  generally  more  internally  located  than  their  silent  counterparts  (Takizawa  et  al.,  2008a)  and  CFTR, 

263  Mashl  and  IgH  display  lower  association  with  the  nuclear  periphery  upon  activation  (Kosak  et  al., 

264  2002;Zink  et  al.,  2004a;W illiams  et  al.,  2006).  Conversely,  genes,  such  as  //VO I,  HXK1  and  GAL1 

265  move  to  the  nuclear  pore  complex  (NPC),  at  the  periphery  of  the  nucleus,  upon  activation  (Brickner 

266  and  Walter,  2004;T addei  et  al.,  2006;B rickner  et  al.,  2007)  and  some  highly  expressed  gene-rich 

267  regions  of  the  genome,  including  major  histocompatibility  complex,  epidermal  differentiation 

268  complex  and  HoxB  loci  loop  out  of  the  bulk  of  their  chromosome  territory  when  active  (Volpi  et  al ., 

269  2000;W illiams  et  al.,  2002;Chambeyron  and  Bickmore,  2004).  Not  all  loci  alter  nuclear  location  at 

270  the  exact  time  of  changes  in  expression.  Some  loci  reposition  before  changes  in  gene  expression, 

271  adopting  poised  positions  in  the  days  leading  up  to  developmental ly  regulated  activation  of 

272  expression  (Ragoczy  et  al.,  2003;Chambeyron  and  Bickmore,  2004;Peric-H upkes  et  al.,  2010), 

273  suggesting,  at  least  for  some  regions,  changes  in  nuclear  positions  are  not  simply  a  consequence  of 

274  the  activation  of  gene  expression.  On  the  other  hand,  some  genes  move  only  after  changes  in 

275  expression,  for  example,  during  erythroid  differentiation  the  mouse 2  -globin  gene  is  activated  prior  to 

276  relocation  away  from  the  nuclear  periphery  (Ragoczy  et  al.,  2006).  In  this  case  the  movement  of  the 

277  locus  might  help  modify  the  magnitude  of  transcription  since  expression  is  further  increased  with  the 

278  repositioning  (Ragoczy  et  al.,  2006).  It  is  not  only  upon  activation  genes  alter  nuclear  location. 

279  Relocation  to  blocks  of  heterochromatin  is  associated  with  gene  repression  for  some  genes  (Brown  et 

280  al.,  1997;Brown  et  al.,  1999;F rancastel  et  al.,  1999;W ijchers  et  al.,  2015);  however,  this  is  not  a 

281  general  rule  as  other  silenced  gene  loci  do  not  associate  with  blocks  of  heterochromatin  (M  oen  et  al., 

282  2004;Takizawaetal.,  2008a). 

283 

284  L  ack  of  correlations  between  spatial  genome  organization  and  transcription 

285 

286  In  addition  to  the  discrepancy  amongst  genes  on  the  timing  of  spatial  repositioning  in  relation  to 

287  altered  transcription,  there  are  a  large  number  of  studies  demonstrating  that  for  many  genes  changes 
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288  in  nuclear  position  and  gene  expression  are  uncoupled  (Scheuermann  et  al 2004;W i I liams  et  al 

289  2006;Kumaran  and  Spector,  2008;Meaburn  and  Misteli,  2008;Takizawa  et  al.,  2008b;Morey  et  al., 

290  2009;Harewood  et  al.,  2010;Hakim  et  al.,  2011;Shachar  et  al.,  2015).  For  instance,  when  a  breast 

291  epithelial  cell  line,  MCF10A,  was  grown  under  four  different  growth  conditions  there  was  no 

292  correlation  between  change  in  gene  expression  and  change  radial  positioning  pattern  for  any  of  the 

293  eleven  genes  analyzed  (M  eaburn  and  M  isteli,  2008).  There  could  be  several  reasons  to  explain  this. 

294  Firstly,  radial  positioning  is  not  an  ideal  measure  to  assess  function  (Takizawa  et  al.,  2008b).  With 

295  the  exception  of  the  NE,  all  radial  locations  within  the  nucleus  are  approximately  equal  with  respect 

296  to  function  and  gene  expression.  Transcription  factories  and  active  genes  are  equally  distributed 

297  throughout  the  nucleus  (Wansink  et  al.,  1993;T akizawa  et  al.,  2008a;Paz  et  al.,  2015).  Thus,  changes 

298  in  radial  position  do  not  necessary  represent  biologically  relevant  changes,  as  they  do  not  correlate 

299  with  moving  a  locus  into  a  more  or  less  transcriptionally  active  environment.  Moreover,  radial 

300  positioning  can  miss  biologically  relevant  changes  to  positioning  if  it  does  not  shift  the  locus  to  a 

301  different  radial  zone.  Proximity  to  and  association  with  nuclear  bodies  is  likely  to  be  far  more 

302  important  for  gene  expression  and  RNA  processing  than  radial  positioning.  Indeed,  for  many  genes 

303  changes  in  gene  expression  accompany  changes  in  association  with  nuclear  sub-compartments, 

304  including  the  lamina,  nucleolus,  transcription  factories,  heterochromatin,  polycomb  bodies,  NPCs, 

305  Cajal  bodies  and  nuclear  speckles  (Ahmed  et  al.,  2010;Ferrai  et  al .,  2010;Bickmore  and  van  Steensel, 

306  2013;Khanna  et  al.,  2014;Dekker  and  M  isteli,  2015;Wang  et  al.,  2016).  Furthermore,  transient  co- 

307  localization  of  X  chromosomes  during  the  establishment  of  X  inactivation  and  co-localization  of 

308  imprinted  genes  with  imprinting  control  region  loci  suggest  a  critical  role  for  spatial  genome 

309  organization  in  establishing  and  maintaining  mono-allelic  expression  in  these  regions  (Fraser  and 

310  Bickmore,  2007;M  acDonald  et  al.,  2016).  Flowever,  radial  positioning  vs  relative  positioning  cannot 

311  be  the  full  story  in  explaining  the  differences  between  the  studies  because  some  genes  spatially 

312  reposition  in  the  absence  of  a  change  in  expression.  For  example,  Mashl  relocates  to  a  more  internal 

313  nuclear  position  upon  activation  during  neuronal  differentiation,  however,  the  repositioning  is  not 

314  specific  to  Mashl  and  several  chromosomally  neighboring  genes  also  reposition  despite  remaining 

315  transcriptionally  silent  (Williams  et  al.,  2006).  This  is  not  limited  to  transcriptionally  silent  genes; 

316  active  genes  can  also  change  nuclear  positions  without  modulating  transcription.  As  an  example,  all 

317  four  genes  (AKT1,  BCL2,  ERBB2  and  VEGF)  identified  to  change  radial  position  after  in  vitro 

318  induction  of  early  breast  cancer  were  expressed  at  similar  levels  in  the  normal  as  compared  to  tumor 

319  cells  (M  eaburn  and  M  isteli,  2008). 

320 

321  It  could  be  that  some  genes  are  more  sensitive  to  their  nuclear  position  for  expression  than  others, 

322  and  the  genes  that  require  specific  positioning  patterns  for  their  correct  expression  drive  the 

323  relocation  of  neighboring  regions  by  "pulling"  them  along.  Consistent  with  this,  when  loci  are  forced 

324  into  a  different  nuclear  environment,  either  by  being  artificially  tethered  or  due  to  a  chromosomal 

325  translocation,  the  expression  of  some  but  not  all  genes  is  affected  (Finlan  et  al.,  2008;Kumaran  and 

326  Spector,  2008;Reddy  et  al .,  2008;Harewood  et  al .,  2010;W  ijchers  et  al .,  2015).  Additionally,  the  type 

327  of  expression  change  may  be  an  important  factor.  M  ost  of  the  genes  that  have  been  identified  to  alter 

328  nuclear  location  in  parallel  with  changes  in  gene  expression  are  genes  that  are  being  activated  or 

329  silenced  during  development  and  differentiation,  i.e.  are  genes  associated  with  a  more  or  less 

330  permanent  change  in  expression,  and  although  these  expression  changes  occur  in  a  coordinated 

331  manner,  it  does  not  need  to  be  rapid.  Thus,  it  could  be  that  a  specific  nuclear  environment  is  required 

332  for  this  activation/repression,  and  time  can  be  taken  for  a  locus  to  be  relocated  to  this  environment, 

333  but  once  a  gene  is  permanently  induced  or  silenced  the  environment  can  change  without  affecting  the 

334  expression  of  that  region.  In  support,  transient  homologous  pairing  of  Oct4  alleles  occurs  during 

335  mouse  ESC  differentiation  only  as  Oct4  becomes  permanently  repressed  (H ogan  et  al.,  2015).  It  is 
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336  also  likely  that  ongoing  transcription  is  not  required  to  maintain  spatial  positioning  patterns,  since 

337  long-range  DNA  interactions  of  active 2 -globin  and  Rad23a  genes  are  maintained  after  the  inhibition 

338  of  RNA  polymerase  II  transcription  (Palstra  etal.,  2008).  Precise  spatial  positioning,  especially  radial 

339  positioning,  may  not  have  a  significant  role  for  all  genes  beyond  switching  on/off.  For  genes  that  turn 

340  on  and  off  rapidly  in  response  to  changing  environmental  signals,  for  example  upon  stimulation  by  a 

341  hormone  or  stress  signals,  rapid  induction  may  be  enhanced  by  proximal  spatial  positioning  of  sets  of 

342  co-regulated  genes  prior  to  the  stimulation,  i.e.  in  a  permanently  poised  position.  In  keeping  with  this, 

343  glucocorticoid  receptor  responsive  genes  cluster  in  the  absence  of  glucocorticoid,  and  (re)stimulation 

344  with  glucocorticoid  does  not  significantly  change  positioning  patterns  (Hakim  et  al .,  2011).  Direct 

345  evidence  for  the  concept  of  a  more  rapid  activation  of  expression  for  genes  already  in  the  correct 

346  nuclear  environment  comes  from  yeast  studies.  The  NPC  associates  with  active  regions  of  the 

347  genome,  with  specific  genes  recruited  to  the  NPC  upon  activation  (Egecioglu  and  Brickner, 

348  2011;L iang  and  Hetzer,  2011).  Even  after  the  stimulation  has  been  removed  and  the  expression  from 

349  these  genes  has  been  repressed  several  genes,  including  HXK1,  GAL1  and  /A/O I,  remain  at  the 

350  nuclear  periphery  and  these  peripheral  loci  are  activated  more  quickly  upon  future  re-stimulation 

351  (Brickner  et  al.,  2007;Tan-Wong  etal.,  2009). 

352 

353  Additional  players  and  functions 

354 

355  Remodeling  of  chromatin  is  intimately  linked  to  gene  expression.  Thus,  it  is  possible  that  chromatin 

356  remodelers  and  the  resulting  alterations  to  histones  and  chromatin  compaction  are  an  important  part 

357  of  the  molecular  machinery  that  reorganizes  the  genome,  rather  than  gene  transcription  itself. 

358  Emerging  evidence  suggests  this  may  well  be  the  case.  Decondensation  of  chromatin  in  the  absence 

359  of  altered  gene  expression,  even  for  silent  genes,  is  sufficient  for  a  locus  to  spatially  reposition 

360  (Chambeyron  and  Bickmore,  2004;T herizols  et  al.,  2014).  Similarly,  modifications  to  histones  are 

361  important  for  anchoring  loci  to  the  nuclear  periphery,  independently  of  a  role  in  gene  repression  these 

362  modifications  are  associated  with  (Harr  et  al.,  2016).  For  example,  histone  acetyl  transferase  SAGA 

363  (Dultz  etal.,  2016),  histone  deacetyl  ases  (Zullo  etal.,  2012;Demmerle  etal.,  2  0 1 3 ;  D  u  I  tz  etal.,  2016) 

364  and  histone  H3K9  di-  and  tri -methyl  ati  on  by  histone  methyl  transferase  (Towbin  et  al.,  2012;B  ian  et 

365  al.,  2013;K ind  et  al .,  2013;H  arr  et  al .,  2015)  are  required  to  localize  at  least  some  genes  to  the  nuclear 

366  periphery.  It  is  most  likely  that  chromatin  remodelers  work  in  tandem  and  that  it  is  specific 

367  combination  of  histone  marks  that  are  required  to  target  a  genomic  region  to  the  nuclear  periphery 

368  (Harr  etal.,  2016).  It  is  also  likely  that  chromatin  remodelers  play  a  role  in  targeting  genes  within  the 

369  nucleoplasm,  away  from  the  NE,  since  chromatin  remodelers  EZH2  and  SUV39H1  have  been 

370  demonstrated  to  be  critical  for  local  chromatin  clustering  into  sub-domains  (Wijchers  et  al.,  2016) 

371  and  knocking  down  a  range  of  chromatin  remodelers  and  histone  modifiers  affected  the  position  of 

372  several  non-NE  associated  genes  (Shachar  etal.,  2015). 

373 

374  In  addition  to  gene  expression  and  chromatin  state,  many  other  nuclear  factors  have  been  linked  to  a 

375  role  in  correct  spatial  organization  of  the  genome,  these  include  actin,  myosin  (Chuang  et  al., 

376  2006;Dundr  et  al.,  2007;Mehta  et  al.,  2010),  NE  proteins,  include  A-type  lamins  (Meaburn  et  al., 

377  2007;M  ewborn  et  al.,  2010;M  ehta  et  al.,  2011;Kubben  et  al.,  2012;Solovei  et  al.,  2013;H  arr  et  al., 

378  2015),  B-type  lamins  (Malhas  et  al.,  2007;Ranade  et  al.,  2016),  lamin  B  receptor  (Clowney  et  al., 

379  2012;Solovei  et  al.,  2013),  emerin  (Demmerle  et  al.,  2013)  and  several  other  NE  transmembrane 

380  proteins  (Zuleger  et  al.,  2013),  NPC  proteins  (Ahmed  et  al.,  2010;D ultz  et  al.,  2016)  and  non-coding 

381  RNAs  (Hacisuleyman  et  al.,  2014). 

382 
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383  Interestingly,  different  genes  can  be  targeted  to  the  same  nuclear  compartment  via  different  protein 

384  complexes  and  pathways,  and  the  use  of  the  different  mechanisms  may  add  an  additional  layer  of 

385  control  over  the  speed  of  inducing  or  enhancing  gene  expression  (Randise-H inchliff  et  al .,  2016). 

386  Moreover,  different  genes  require  different  proteins  complexes  for  their  correct  nuclear  location. 

387  SiRNA  screening  of  669  nuclear  proteins,  combined  with  mapping  the  radial  position  of  3-4  genes  by 

388  FISH,  identified  50  proteins  with  roles  in  spatially  organizing  the  genome.  However,  of  these 

389  proteins,  only  knocking  down  one,  CACNG1,  repositioned  all  four  genes  studied.  M  oreover,  half  of 

390  the  hits  affected  the  position  of  only  a  single  gene  (Shachar  et  al.,  2015). 

391 

392  At  some  level  underlying  DNA  sequence  must  play  a  role  in  the  spatial  positioning  of  the  genome. 

393  Transcription  factors  bind  to  specific  sequences  in  the  genome,  thereby,  activating  a  specific  set  of 

394  genes,  and  genes  regulated  by  specific  transcription  factors  can  spatially  cluster  (Schoenfelder  et  al., 

395  2010;Hakim  et  al.,  2011; H  aki m  et  al.,  2013).  Some  specific  DNA  sequences  have  been  demonstrated 

396  to  have  a  direct  role  in  targeting  genes  to  specific  locations  within  the  nucleus.  In  S.  cereviside,  two 

397  short  gene  recruitment  sequences,  within  gene  promotors,  are  required  to  target  inducible  genes  to  the 

398  NPC,  allowing  an  enhanced  transcriptional  output  (Ahmed  et  al.,  2010).  It  has  recently  been 

399  proposed  that  the  association  between  repetitive  sequences,  in  particular  homotypic  associations 

400  between  short  interspersed  retrotransposable  elements  (SINEs),  which  are  enriched  in  gene-rich 

401  regions  of  the  genome,  or  long  interspersed  retrotransposable  elements  (LINEs),  which  are  enriched 

402  in  gene-poor  regions,  may  be  a  key  underling  factor  responsible  for  the  spatial  separation  of  gene- 

403  rich  and  gene-poor  regions  of  the  genome  (Cournac  et  al.,  2016;Solovei  et  al.,  2016).  However, 

404  spatial  positioning  patterns  cannot  be  an  intrinsic  feature  of  a  piece  of  DNA,  since,  as  discussed 

405  above,  genes  and  chromosomes  can  alter  position  without  changes  in  DNA  sequence,  e.g.  between 

406  cell  types  or  upon  activation.  Additionally,  some  human  chromosomes,  including  chromosome  X, 

407  adopt  different  nuclear  positions  when  they  are  in  a  mouse  nucleus  compared  to  when  they  are  within 

408  their  endogenous  human  cell  (Meaburn  et  al.,  2008).  It  is  also  unlikely  all  repeat  sequences  direct 

409  spatial  organization,  as  peri centromeric  satellite  repeats  migrate  to  existing  heterochromatin 

410  compartments  during  differentiation  rather  than  drive  the  aggregation  of  heterochromatin  (Wijchers 

411  et  al .,  2015). 

412 

413  Replication  has  also  been  linked  to  the  spatial  organization  of  the  genome.  Genomic  regions  change 

414  nuclear  location  as  they  change  replication  timing  during  development  and  disruptions  to  replication 

415  can  affect  the  radial  position  of  some  genes  (Hiratani  et  al.,  2008;Shachar  et  al.,  2015).  Similarly, 

416  mitosis  and  NE  reformation  in  early  Gi  are  important  for  some  loci  to  establish  nuclear  location, 

417  especially  for  gene-poor  and  inactive  regions  when  associating  with  the  NE  (Bridger  et  al., 

418  2000;Kumaran  and  Spector,  2008;K ind  et  al.,  2013)  or  during  muscle  differentiation  (Neems  et  al., 

419  2016).  However,  once  more,  these  cannot  be  general  mechanisms  for  all  genes.  Firstly,  several  genes 

420  and  chromosomes  reposition  as  cells  exit  the  cell  cycle  and  become  quiescent  or  senescent,  which 

421  clearly  occurs  in  the  absence  of  replication,  mitosis  and  NE  reformation  (Bridger  et  al .,  2000;Solovei 

422  et  al.,  2004;M  eaburn  et  al.,  2007;M  eaburn  and  M  isteli,  2008).  Secondly,  an  alternative  mechanism 

423  must  be  required  for  genes  that  move  rapidly  after  stimulation.  For  example,  relocation  to  the  NPC 

424  occurs  within  15-60  mins  of  activation  in  60%  of  nuclei  (Randise-H  inchliff  et  al.,  2016).  This  timing 

425  is  too  fast  for  replication,  mitosis  or  NE  reformation  to  play  a  major  role. 

426 

427  The  spatial  organization  of  the  genome  may  also  have  a  role  in  DNA  repair,  since  whole 

428  chromosomes  can  reposition  after  DNA  damage  (M  ehta  et  al.,  2013;loannou  et  al.,  2015),  however, 

429  given  that  in  mammalian  cells  broken  DNA  ends  do  not  significantly  move  (Kruhlak  et  al., 

430  2006;Soutoglou  et  al.,  2007;Roukos  et  al.,  2013),  the  function  of  this  repositioning  is  not  clear.  What 
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431  is  clear,  however,  is  the  role  the  spatial  organization  plays  in  the  partner  choice  when  oncogenic 

432  chromosomal  translocations  form,  after  the  DNA  breaks  have  occurred.  Translocations  preferentially 

433  form  between  loci  that  are  in  close  spatial  proximity  prior  to  damage  (Lukasova  et  al .,  1997;Neves  et 

434  al.,  1999;Roix  et  al.,  2003;Lin  et  al.,  2009;Mani  et  al.,  2009;Mathas  et  al.,  2009;C hiarle  et  al., 

435  2011;K lei n  et  al.,  2011;Hakim  et  al.,  2012;Roukos  et  al.,  2013).  In  a  similar  vein,  the  spatial 

436  organization  of  the  genome  also  helps  dictate  the  integration  sites  of  HIV-1  virus  DNA  into  the 

437  human  genome  after  infection.  HIV-1  integrates  preferentially  into  highly  transcribing  genes  that 

438  associated  with  the  N  PC  (Lelek  et  al.,  2015;M  arini  et  al.,  2015).  A  direct  functional  role  for  spatial 

439  genome  positioning  has  not  only  been  shown  for  disease  progression.  In  the  rod  cells  of  nocturnal 

440  mammals  a  redistribution  of  heterochromatin  to  the  center  of  the  nucleus  and  euchromatin  to  the 

441  nuclear  periphery  aids  night  vision  by  optimizing  the  diffraction  of  light  though  the  retina  (Solovei  et 

442  al.,  2009). 

443 

444  Thus,  while  trends  are  emerging  as  to  how  and  why  the  genome  is  spatially  organized,  no  "one  size 

445  fits  all  model"  has  yet  to  clearly  present  itself.  This  is  not  surprising  given  the  complexity  of  the 

446  nucleus.  Most  likely  it  will  be  a  combination  of  molecular  strategies  working  in  concert,  and 

447  differing  depending  on  the  gene  and  the  cellular  context,  that  optimizes  correct  nuclear  function  and 

448  organization. 

449 

450  Spatial  reorganization  of  the  genome  in  non-malignant  disease 

451 

452  Even  though  many  mechanistic  and  functional  aspects  of  the  spatial  organization  of  the  genome  have 

453  yet  to  be  resolved,  it  is  becoming  increasingly  evident  that  genomic  spatial  positioning  patterns  can 

454  be  used  as  markers  of  disease.  In  a  wide  variety  of  diseases,  including  cancer,  individual  genes, 

455  genomic  regions  and/or  whole  chromosomes  reposition  (Figure  2)  (Ferrai  et  al.,  2010;B ridger  et  al., 

456  2014;M  eaburn  et  al.,  2016a).  This  phenomenon  was  first  identified  in  the  1980s  in  brain  tissues  of 

457  epilepsy  patients  (Borden  and  Manuelidis,  1988).  The  centromere  of  chromosome  X  redistributes 

458  away  from  the  NE  or  edge  of  nucleoli  in  neurons  from  the  epileptic  seizure  focus  as  compared  to 

459  normal  neurons  from  unaffected  areas  of  the  brain  from  the  same  individuals  (Figure  2)  (Borden  and 

460  M  anuelidis,  1988).  This  repositioning  is  not  common  to  other  heterochromatin  regions,  as  neither  the 

461  peri centromeric  hetero chromatin  regions  on  chromosomes  1  (lql2)  and  9  (9ql2)  nor  the  large 

462  heterochromatin  block  mapping  to  Yql2  reposition  (Borden  and  Manuelidis,  1988).  Generally 

463  speaking,  different  loci  have  been  positioned  in  different  diseases,  with  very  few  loci  studied  in  more 

464  than  one  affliction,  making  it  unclear  how  general  the  disease- related  repositioning  of  a  given  locus 

465  is.  At  least  some  repositioning  events  are  different  between  different  diseases,  however.  W  hile  lql2 

466  is  not  repositioned  in  epilepsy,  in  the  lymphocytes  from  patients  with  endometriosis  lql2  is  more 

467  peripherally  located,  compared  to  control  individuals,  (Mikelsaar  et  al.,  2014)  and  more  internally 

468  located  in  Immunodeficiency,  Centromere  instability  and  Facial  anomalies  syndrome  (ICF)  patient 

469  lymphocytes  (Jefferson  et  al.,  2010;D  upont  et  al .,  2012).  Additional  genomic  regions  were  also  found 

470  to  reposition  in  ICF,  including  a  reduced  association  of  chromosome  16' s  peri  centromeric 

471  heterochromatin  region  with  the  N  E,  and  an  increased  incidence  of  human  pseudoautosomal  region  2 

472  genes  looping  out  of  their  chromosome  territories  (chromosome  Y  and  the  inactive  X)  (M  atarazzo  et 

473  al.,  2 007 Jefferson  et  al.,  2010).  Similarly  to  epilepsy,  loci-specific  repositioning  was  observed  in 

474  ICF,  with  other  regions  on  chromosome  1  ( BTG2 ,  CNN3,  ID3,  and  RGS1 )  and  the  peri  centromeric 

475  heterochromatin  of  chromosome  9  not  repositioning  in  ICF  (Jefferson  et  al.,  2010;Dupont  et  al., 

476  2012).  Peri  centromeric  loci  are  not  the  only  repeat  sequences  to  be  spatially  affected  in  disease. 

477  Aberrant  telomere  clustering  has  been  identified  in  several  diseases,  including  cancer  and 
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478  Alzheimer's,  often  due  to  telomere  dysfunction,  and  the  degree  of  clustering  can  reflect  the 

479  aggressiveness  of  the  disease  (Chuang  etal.,  2004;Gadji  etal.,  2010;M  athuretal.,  2014). 

480 

481  Loci-specific  repositioning  also  occurs  in  laminopathies  (Figure  2)  (M  eaburn  et  al .,  2007;Taimen  et 

482  al.,  2009;M  ewborn  et  al.,  2010;M  attout  et  al.,  2011;M  ehta  et  al.,  2011;M  cCord  et  al.,  2013),  a  wide 

483  ranging  group  of  diseases  characterized  by  mutations  in  lamin  A/C,  and  includes  Emery-Dreifuss 

484  muscular  dystrophy  and  the  premature  aging  disease  H utchi nson-G i Iford  progeria  syndrome  (HGPS) 

485  (Dittmer  and  Misteli,  2011).  The  organization  of  chromosomes  in  proliferating  laminopathy 

486  fibroblasts  cells  mimics  that  of  normal  quiescent  cells,  with  chromosomes  13  and  18  shifted  to  a 

487  more  internal  position,  chromosome  10  repositioning  towards  the  nuclear  periphery,  and 

488  chromosome  4  and  X  remaining  at  the  nuclear  periphery  (M  eaburn  et  al.,  2007;M  ewborn  et  al., 

489  2010;M  ehta  et  al.,  2011).  Generally,  the  spatial  reorganization  of  the  genome  very  similar  between 

490  different  laminopathy  patients,  regardless  of  the  disease  or  mutation  present  (M  eaburn  et  al., 

491  2007;M  ewborn  et  al.,  2010),  the  exception  being  that  chromosome  13  is  more  peripheral  in  patients 

492  with  "  303  or  D596N  LMNA  mutations  (M  ewborn  et  al.,  2010).  M  oreover,  the  spatial  organization  of 

493  the  genome  in  asymptomatic  laminopathy  carriers  is  very  similar  to  the  laminopathy  patients 

494  (M  eaburn  et  al.,  2007).  Taken  together,  this  suggests  that  repositioning  results  from  the  presence  of 

495  aberrant  A -type  lamin  rather  than  the  specific  disease  per  se.  The  release  of  chromosome  13  and  18 

496  from  the  nuclear  periphery  may  be  linked  to  changes  in  chromatin  status,  si  nee  there  is  a  reduction  of 

497  heterochromatin-specific  markers,  including  HP1±  and  histone  H3K9  methylation,  in  HGPS 

498  fibroblasts  (Scaffidi  and  M  isteli,  2005),  or  it  may  be  due  to  an  uncoupling  of  cell-cycle  regulation 

499  and  spatial  positioning  patterns  (M  eaburn  et  al.,  2007).  In  progeria  at  least,  the  redistribution  of  the 

500  genome  may  be  progressive.  An  abnormal  clustering  of  centromeres  associated  with  the  LMNA 

501  E145K  mutation  only  occurs  in  nuclei  with  aberrant  shapes,  and  the  prevalence  increases  with 

502  increased  time  in  culture  (Taimen  et  al.,  2009).  Compared  to  proliferating  normal  fibroblasts,  a 

503  dramatic  genome-wide  reduction  in  the  spatial  separation  of  active  and  inactive  chromatin  was 

504  observed  in  senescent  HGPS  fibroblasts,  but  not  in  proliferating  HGPS  cultures  (McCord  et  al., 

505  2013).  Some  of  these  differences  in  interaction  frequencies  were  senescence-  rather  than  disease- 

506  related  as  they  were  also  detected  in  normal  senescent  fibroblasts,  however,  differences  between 

507  senescent  normal  and  HGPS  cells  were  detected  (Chandra  et  al.,  2015).  It  remains  unclear  how 

508  disease  related  these  differences  are,  as  it  could  be  more  a  reflection  of  the  type  of  senescence  being 

509  compared.  The  HGPS  cells  used  had  reached  (early)  replicative  senescence  by  continued  culture 

510  (M  cCord  et  al.,  2013),  whereas  senescence  was  induced  in  the  normal  cells  by  oncogene  activation, 

511  which,  unlike  replicative  senescence,  forms  senescence-associated  heterochromatin  foci  (Chandra  et 

512  al.,  2015).  The  laminopathy- related  spatial  positioning  patterns  are  tissue-specific,  and  chromosomal 

513  positioning  abnormalities  have  not  been  detected  in  laminopathy  patient  lymphoblasts,  a  cel  I -type 

514  that  does  not  require  A-type  lamins  (Boyle  et  al.,  2001;M  eaburn  etal.,  2005).  Further,  in  a  C.  e/egans 

515  model  of  Emery-Dreifuss  muscular  dystrophy,  a  muscle-specific  promotor  was  retained  at  the  NE  in 

516  the  muscles  of  diseased  animals,  however,  the  release  from  the  NE  of  a  gut-specific  promotor  was 

517  not  affected  in  gut  tissue  (M  attout  etal.,  2011). 

518 

519  In  benign  prostate  hyperplasia  tissues  MM P2  shifts  to  a  more  peripheral  position  than  normal  prostate 

520  tissue,  whereas  the  positions  of  MMP9  and  FLU  are  unaffected  (Figure  2)  (Leshner  et  al.,  2016).  In 

521  lymphocytes  from  Down  syndrome  patients,  two  (or  three)  copies  of  chromosome  21  clusters 

522  together  more  frequently  than  in  diploid  control  cells  (Figure  2)  (Paz  et  al.,  2013;Paz  et  al.,  2015). 

523  Moreover,  SOD1,  which  is  located  on  chromosome  21,  is  more  peripherally  located  in  Down 

524  syndrome  (Paz  et  al.,  2015).  This  altered  position  is  not  simply  a  consequence  of  the  presence  of  an 

525  additional  chromosome  21,  however,  as  the  radial  position  of  DYRK1A,  which  also  maps  to 
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526  chromosome  21,  is  unaffected  (Paz  et  al 2015).  It  is  not  only  chronic  diseases  that  are  associated 

527  with  a  spatial  reorganization  of  the  genome.  Short-term  repositioning  of  some,  but  not  all,  genes  or 

528  chromosomes  occurs  after  viral  or  parasitic  infection  (Li  et  al.,  2010;K night  et  al.,  2011;A rican- 

529  Goktas  et  al.,  2014).  Chromosome  17,  but  not  18,  relocates  to  a  more  peripheral  position  in  human 

530  lymphocytes  within  24  hours  of  Epstein-Barr  virus  infection  (Li  et  al.,  2010).  Similarly,  dynamic 

531  radial  repositioning  of  actin,  ferritin  and  hsp70,  occurs  in  both  cultured  snail  cells  and  in  intact  live 

532  snails  within  hours  of  active  parasitic  infection,  with  the  repositioning  correlating  to  changes  in  gene 

533  expression  (K  night  et  al .,  2011;A  rican-G  oktas  et  al .,  2014). 

534 

535  Local  disruptions  to  hierarchical  genome  organization  can  be  important  for  disease  progression. 

536  High-resolution  mapping  of  the  3D  genome  has  revealed  that  beyond  a  spatial  separation  of  gene-rich 

537  and  gene-poor  chromatin,  chromosomes  further  fold  into  a  succession  of  ten  to  hundreds  of  kb  long 

538  self- associating  globular  domains,  known  as  topologically  associating  domains  (TADs)  (Dixon  et  al., 

539  2012;Nora  et  al.,  2012;Sexton  et  al.,  2012).  Individual  TADs  are  stably  conserved  between  species 

540  and  cell  types  and  are  separated  by  sharp  boundaries  that  are  enriched  in  histone  marks  associated 

541  with  active  genes,  SINEs,  binding  sites  for  the  chromatin-binding  protein  CTCF  and  housekeeping 

542  genes  (Dixon  et  al.,  2012).  TAD  boundaries  are  believed  to  be  important  for  gene  regulation,  by 

543  acting  as  insulators,  inhibiting  the  expansion  of  either  heterochromatin  or  active  chromatin  into  a 

544  neighboring  TADs  and  by  increasing  the  frequency  of  specific  regulatory  interactions  by  confining 

545  certain  promoters  and  enhances  into  a  singleTAD  (Dixon  et  al .,  2012;Sexton  et  al.,  2012;Narendra  et 

546  al.,  2015).  Consentient  with  this,  genes  within  TAD  domains  are  co-expressed  during  differentiation 

547  and  disruption  of  TAD  boundaries  results  in  a  partial  or  complete  fusion  of  neighboring  TADs  and 

548  altered  gene  expression  (Nora  et  al.,  2012; L upianez  et  al.,  2015;Narendra  et  al.,  2015;H nisz  et  al., 

549  2016).  Disruption  to  the  TAD  boundary  near  the  EPHA4  locus  by  deletion,  duplication  or  inversion 

550  of  DNA  sequences  at  the  boundary  results  in  limb  deformation  syndromes  (Lupianez  et  al .,  2015).  In 

551  normal  limb  development  EPHA4  is  active  but  genes  in  neighboring  TADs  are  silent.  The  loss  of  the 

552  TAD  boundary  allows  enhancer-promoter  interactions  between  regulatory  sequences  normally 

553  separated  into  different  TADs,  and  consequently  results  in  the  spuriously  activation  of  the 

554  neighboring  genes  in  the  mis-fused  TAD  (Lupianez  et  al.,  2015).  Similarly,  cancer-associated  micro- 

555  deletions  or  mutations  at  TAD  boundaries  can  activate  oncogenes,  and  may  represent  a  common 

556  mechanism  of  gene  mis-regulation  in  multiple  cancers  (Hnisz  et  al .,  2016).  The  study  of  TADs  is  still 

557  in  its  infancy,  and  most  likely  disruption  to  TAD  boundaries  will  be  detected  as  functionally 

558  important  in  many  other  diseases. 

559 

560  Spatial  genome  organization  in  cancer 

561 

562  Despite  the  high  level  of  genomic  instability  associated  with  many  cancers,  which  could  conceivably 

563  be  expected  to  result  in  a  large  scale  disruption  to  the  spatial  organization  of  the  genome,  there  is  a 

564  general  conservation  of  spatial  positioning  patterns  in  cancer,  with  the  majority  of  genes  and 

565  chromosomes  not  altering  nuclear  position  (Kozubek  et  al.,  2002;Parada  et  al.,  2002;Cremer  et  al., 

566  2003;M  eaburn  and  M  isteli,  2008;M  eaburn  et  al.,  2009;Timme  et  al.,  2 0 1 1 ; Z ei tz  et  al.,  2013;Leshner 

567  et  al.,  2016;M  eaburn  et  al.,  2016b).  For  example,  44  of  47  (94%)  genes  spatially  mapped  in  human 

568  prostate  tissues  do  not  alter  radial  position  in  prostate  cancer  and  the  radial  positions  of  13  out  of  23 

569  (57%)  genes  are  conserved  in  breast  cancer  (M  eaburn  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al., 

570  2016b).  While  genome  positioning  patterns  remain  largely  conserved  in  cancer,  some  genes  and 

571  chromosomes  occupy  distinct  nuclear  positions  in  cancerous  cells  compared  to  their  normal  or 

572  benign  counterparts,  with  the  patterns  of  reorganization  differing  between  different  cancers  (Figure  2) 

573  (Cremer  et  al.,  2003;M  urata  et  al.,  2007;M  eaburn  and  M  isteli,  2008;M  eaburn  et  al.,  2009;Wiech  et 
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574  al 2009;Zeitz  et  al 2013;Leshner  et  al 2016;M  eaburn  et  al 2016b).  For  instance,  the  radial 

575  positions  of  chromosomes  18  and  19  are  affected  in  many  cancer  types,  including  thyroid,  colon  and 

576  cervical  cancer,  with  a  tendency  for  chromosome  18  to  become  more  internally  located  and 

577  chromosome  19  to  shift  away  from  the  center  of  the  nucleus  (Cremer  et  al.,  2003;Murata  et  al., 

578  2007;Wiech  et  al.,  2009).  However,  in  some  cervical  cancers,  chromosome  18  becomes  more 

579  internally  located  (Wiech  et  al.,  2009).  In  breast  cancer  cell  lines  there  is  a  significant  change  to  the 

580  set  of  genes  that  are  close  spatial  neighbors  of  IGFBP3  (Zeitz  et  al.,  2013).  Moreover,  ten  genes 

581  (FLU,  HES5,  MMP9,  HSP90AA1,  TGFB3,  MYC,  ERBB2,  FOSL2,  CSF1R  and  AKT1)  have  been 

582  identified  to  radially  reposition  in  epithelial  cells  of  breast  cancer  tissues  (M eaburn  et  al., 

583  2009;M eaburn  et  al.,  2016b)  and  FLU,  MMP9  and  MMP2  occupy  alternative  radial  locations  in 

584  prostate  cancer  (Leshner  et  al .,  2016). 

585 

586  Interestingly,  of  the  eleven  genes  so  far  reported  with  altered  radial  position  in  either  breast  or 

587  prostate  cancer,  only  M M P9  and  FLU  robustly  reposition  in  both  types  of  cancer  (Figure  2)  (Leshner 

588  et  al.,  2016;M  eaburn  et  al.,  2016b).  This  suggests  cancer-type  specific  repositioning,  with  different 

589  sets  of  genes  repositioning  in  breast  and  prostate  cancer.  Similarly,  BCL2  repositions  in  a  BCL2- 

590  positive  cervical  squamous  carcinoma,  but  not  in  a  BCL2-negative  cervical  squamous  carcinoma 

591  (Wiech  et  al.,  2009),  nor  in  breast  (M  eaburn  et  al.,  2009)  or  prostate  cancer  (Leshner  et  al.,  2016). 

592  When  taken  together  with  the  findings  discussed  above  that  a  genomic  region  can  behave  differently, 

593  with  respect  to  repositioning  between  different  non-cancerous  diseases,  it  suggests  that  the  overall 

594  organization  of  the  genome  in  disease  is,  mostly,  specific  to  the  disease  and  not  a  consequence  of  a 

595  more  general  "stress”  response.  In  keeping  with  this,  of  the  genes  that  occupy  distinct  radial  positions 

596  in  breast  and/or  prostate  cancer,  only  MMP2  is  also  commonly  repositioned  in  non-malignant 

597  disease;  but  even  for  this  gene  it  cannot  be  a  general  disease  response  as  MMP2  repositions  in 

598  prostate  but  not  breast  cancer  (Figure  2)  (Leshner  et  al.,  2016;M  eaburn  et  al .,  2016b). 

599 

600  It  is  interesting  to  note  that  many  of  the  factors  implicated  in  influencing  gene  positioning  patterns 

601  are  often  altered  in  cancer,  e.g  lamins,  gene  expression,  proliferation  status,  chromatin  status  and 

602  histone  modifications.  However,  the  precise  mechanisms  involved  in  the  mis-organization  of  the 

603  genome  in  cancer  are  unclear.  It  would  seem  that,  for  the  radially  repositioned  genes  at  least, 

604  transcription  of  the  locus  itself  is  not  the  key  to  the  repositioning.  No  correlation  was  found  for  the 

605  repositioning  of  a  given  gene  and  its  transcriptional  output  in  a  model  of  early  cancer  (M  eaburn  and 

606  Misteli,  2008).  Moreover,  cancer-associated  genes,  that  are  known  to  have  altered  expression  in 

607  cancer,  often  did  not  reposition  in  breast  or  prostate  cancer  (M  eaburn  and  M  isteli,  2008;M  eaburn  et 

608  al.,  2009;Leshner  et  al.,  2016).  Even  for  the  cancer-associated  genes  that  do  reposition  in  cancer,  it 

609  may  not  be  altered  transcription  of  the  loci  that  is  driving  the  alternative  positioning  pattern.  For 

610  example,  although  elevated  expression  of  MMP9  and  MMP2  is  a  marker  of  poor  prognosis  in 

611  prostate  cancer,  and  linked  to  metastasis,  their  expression  is  from  tumor-associated  stroma  cells  and 

612  not  from  the  cancerous  epithelial  cells,  where  the  gene  repositioning  was  detected  (Egeblad  and 

613  Werb,  2002;Leshner  et  al.,  2016).  This  does  not  rule  out  gene  expression  as  a  factor  in  repositioning, 

614  and  further  studies  are  required  that  measure  gene  expression  changes  in  extended  chromosomal 

615  regions  of  the  repositioned  gene. 

616 

617  An  increase  in  proliferation  is  a  hallmark  of  cancer.  The  spatial  reorganization  of  the  genome  in 

618  cancer  does  not  seem  to  simply  be  a  reflection  of  a  transition  from  quiescent  normal  breast  or  prostate 

619  tissue  to  a  proliferating  tumor.  Spatial  genome  organization  in  cancer  cells  do  not  mimic  proliferating 

620  (or  quiescent)  breast  epithelial  cells  (M  eaburn  and  M  isteli,  2008).  This  is  not  surprising,  given  the 

621  fact  that  the  majority  of  cells  within  a  cancerous  tumor  are  actually  not  proliferating.  Ki-67  is 
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622  commonly  used  as  a  marker  of  cellular  proliferation,  since  it  is  present  in  all  stages  of  the  cell  cycle 

623  with  the  exception  of  Go.  Breast  cancers  that  contain  e30%  Ki-67  positive  cells  are  classified  as 

624  highly  proliferative  cancers,  and  the  majority  of  breast  cancers,  almost  60%,  have  only  10%  or  less  of 

625  cells  positive  for  Ki-67  (de  Azambuja  et  al .,  2007;J onat  and  Arnold,  2011).  In  fact,  no  feature  of  a 

626  genomic  locus  has  yet  been  identified  that  enables  the  prediction  of  whether  a  gene  will  reposition  or 

627  not  in  cancer  (M  eaburn  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al.,  2016b). 

628 

629  Clinical  biomarker  potential 

630 

631  Regardless  of  why  the  genome  is  spatially  reorganized  in  the  nuclei  of  cancer  cells,  the  small  scale 

632  studies  so  far  performed  suggest  that  these  differences  in  positioning  patterns  have  the  potential  to  be 

633  exploited  for  diagnostic  use  (M  eaburn  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al.,  2016b). 

634  Distinguishing  cancer  from  normal  tissue  by  spatial  gene  positioning  patterns  could  practically  be 

635  integrated  into  clinical  use  because  it  requires  only  a  small  tissue  sample  for  analysis  (100-1000 

636  cells),  meaning  it  will  not  require  additional  invasive  procedures  for  the  patient.  M  oreover,  FISH  is 

637  currently  already  used  in  diagnostic  labs,  thus,  the  infrastructure  for  such  analysis  already  exists  in 

638  the  clinic.  Importantly  for  clinical  applications,  for  most  genes  tested  in  breast  and  prostate  tissues 

639  false  positive  rates  were  low  since  for  the  vast  majority  of  genes  spatial  positioning  patterns  are 

640  highly  similar  between  individuals  in  non-diseased  tissue  (Borden  and  Manuelidis,  1988; W  iech  et  al ., 

641  2005;M  urata  et  al.,  2007;M  eaburn  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al.,  2016b)  and 

642  repositioning  events  did  not  generally  occur  in  non-malignant  breast  and  prostate  disease  (M  eaburn 

643  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al.,  2016b).  There  were  a  few  notable  exceptions,  M M P2 

644  repositions  in  both  cancer  and  non-malignant  disease,  and  the  radial  position  of  a  few  genes,  such  as 

645  CCND1  and  TIMP3,  were  highly  variable  between  normal  prostate  tissues.  This  highlights  that  each 

646  candidate  marker  gene  must  be  rigorously  tested  to  eliminate  genes  with  high  false  positive  rates,  as 

647  is  standard  practice  for  any  marker  brought  to  the  clinic.  Critically,  particularly  when  genes  are  used 

648  in  combination,  spatial  positioning  patterns  can  detect  breast  or  prostate  cancer  with  high  accuracy 

649  (M  eaburn  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al.,  2016b).  When  genes  are  used  as  markers 

650  singularly,  between  64-100%  of  samples  are  correctly  identified  as  cancer  based  on  radial  positioning 

651  patterns.  Multiplexing  genes  improved  the  accuracy  of  cancer  detection  for  genes  with  lower 

652  sensitivities,  reducing  false  negative  rates  (M  eaburn  et  al.,  2009;Leshner  et  al.,  2016;M  eaburn  et  al., 

653  2016b).  Repositioning  events  are  independent  of  variations  in  copy  number  and  even  in  the 

654  background  of  highly  variable,  and  in  some  cases  large  scale,  genomic  instability  between  cancer 

655  specimens,  it  is  possible  to  use  gene  positioning  to  accurately  detect  cancers  (M  eaburn  et  al., 

656  2009;Leshner  et  al.,  2016;M  eaburn  et  al.,  2016b). 

657 

658  Although  these  studies  represent  the  largest  analysis  of  genome  organization  in  cancerous  tissues 

659  performed  to  date,  the  eleven  genes  identified  as  potential  breast  or  prostate  cancer  biomarkers  were 

660  analyzed  in  a  low  number  of  individuals  (11-19  cancers,  6-10  normal  tissues  and  5  non-malignant 

661  disease  tissues)  (M eaburn  et  al.,  2009;Leshner  et  al.,  2016;M eaburn  et  al.,  2016b).  Larger  scale 

662  follow  up  studies  are  required  to  assess  the  sensitivity  and  specificity  of  these  markers,  to  determine 

663  if  they  are  strong  enough  for  clinical  use.  It  will  be  important  to  compare  gene-positioning 

664  biomarkers  to  existing  diagnostic  methodologies  and  to  establish  if  using  the  various  methods  in 

665  combination,  or  alone,  improves  diagnostic  accuracy.  Additionally,  the  original  studies  were  not 

666  designed  to  determine  if  the  spatial  organization  of  the  genome  in  cancer  has  prognostic  value.  Some 

667  of  the  genes  so  far  identified  are  unlikely  to  be  useful  for  prognostics.  For  example,  FLU  repositions 

668  in  100%  (10/10)  of  breast  cancers  and  92.9%  (13/14)  of  prostate  cancers  (Leshner  et  al., 

669  2016;M  eaburn  et  al.,  2016b),  rates  far  too  high  to  suggest  they  can  be  used  to  stratify  indolent  and 
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670  aggressive  cancers.  Future  screening  studies  are  required,  using  a  highly  defined  set  of  indolent  and 

671  aggressive  cancers.  Given  the  fact  that  the  position  of  a  gene  can  vary  between  cancers  derived  from 

672  the  same  organ,  and  between  cancers  from  different  organs  (Meaburn  et  al .,  2009;Wiech  et  al ., 

673  2009;Leshner  et  al.,  2016;Meaburn  et  al.,  2016b),  it  suggests  the  possibility  that  the  spatial 

674  organization  of  the  genome  can  be  used  to  sub-type  cancers  for  prognostic  purposes.  It  is  "simply"  a 

675  matter  of  finding  the  right  genes.  Since  spatial  reorganization  of  the  genome  has  been  identified  in 

676  multiple  human  diseases,  it  is  likely  that  the  same  principle  of  using  spatial  genome  positioning 

677  patterns  as  a  diagnostic  biomarker  could  be  applied  to  any  disease. 

678 

679  Summery 

680 

681  There  is  no  doubt  that  the  genome  is  non-randomly  organized  in  interphase  nuclei.  However,  a  deep 

682  understanding  of  the  functional  relevance  of  the  spatial  organization  of  the  genome  and  the  molecular 

683  mechanisms  required  to  correctly  position  genes  and  chromosomes  has  proved  more  elusive.  Until 

684  recently,  due  to  practical  constraints,  loci  could  only  realistically  be  studied  in  virtual  isolation.  Yet  it 

685  is  increasingly  emerging  that  there  is  likely  multiple  mechanisms  important  for  positioning  a  loci, 

686  and  that  the  mechanism  employed  depends  on  the  individual  gene  and  the  cellular  context.  With  the 

687  rapidly  reducing  cost  of  sequencing  and  the  advent  of  multiple  biochemical  methods  to  analyze 

688  genome  organization  on  a  larger  scale,  the  tools  are  now  in  place  to  study  the  spatial  organization  of 

689  chromatin  on  a  genome  wide  scale.  It  will  be  important  to  compare  these  positioning  maps  to  gene 

690  expression  data,  epigenetic  maps,  proteome  data,  non-coding  RNA,  replication  timings  etc,  in 

691  combination.  Moreover,  it  will  be  important  to  not  only  ask  what  is  changing  at  the  level  of  an 

692  individual  locus,  it  will  be  vital  to  consider  the  chromosomal  neighborhood  at  large.  The  new 

693  technologies  will  not  make  FISH  obsolete;  in  fact,  quite  the  opposite.  It  will  be  critical  to  study 

694  candidate  regions,  identified  by  population-based  biochemical  methods,  in  greater  detail  using  single 

695  cell  FISH  analysis.  For  example,  in  how  many  cells  is  a  given  positioning  patterns  found?  Do  both 

696  alleles  of  a  gene  behave  in  the  same  manner?  Are  positioning  patterns  found  in  the  population 

697  mutually  exclusive  at  the  single  cell  level?  From  here,  loci  or  nuclear  factors  can  then  be  manipulated 

698  to  determine  the  true  functional  relevance,  beyond  correlation. 

699 

700  The  genome  is  spatially  reorganized  in  disease.  Accumulating  evidence  suggests  that  these 

701  repositioning  patterns  can  be  exploited  for  diagnostic  use.  Currently,  there  is  no  way  to  predict  which 

702  genes  will  reposition  in  cancer.  As  we  gain  a  better  understanding  of  how  spatial  positioning  is 

703  regulated  in  a  normal  cell,  it  may  become  easier  to  predict  regions  that  will  be  diagnostically 

704  interesting.  The  reverse  is  also  true.  Comparing  positioning  patterns  between  normal  and  diseased 

705  cells  will  inform  our  understanding  of  spatial  genome  organization.  While  it  is  exciting  that  proof-of- 

706  principle  studies  have  identified  spatial  positioning  of  the  genome  as  a  potentially  sensitive  and 

707  specific  biomarker  of  cancer,  much  work  is  yet  to  be  done,  large-scale  follow-up  studies  are  required 

708  to  determine  the  feasibility  and  usefulness  of  spatial  genome  organization-based  diagnostic  in  the 

709  clinical  setting.  Moreover,  while  additional  diagnostic  tests  have  their  use,  there  is  urgent  need  for 

710  biomarkers  that  can  distinguish  aggressive  cancers  from  indolent  ones.  These  markers  would  have  a 

711  tremendous  impact  on  cancer  patients,  by  guiding  clinicians  to  the  best  treatment  regime  for  an 

712  individual  and  reducing  both  under  and  over-treatment.  As  such,  as  future  studies  relating  to  genome 

713  organization  for  clinical  use  should  focus  on  prognostic  rather  than  simply  diagnostic. 

714 

715 
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716  Abbreviation  list 

111 

718  FISH  =  Fluorescence  in  situ  hybridization 

719  ICF  =  Immunodeficiency,  Centromere  instability  and  Facial  anomalies  syndrome 

720  HGPS  =  H  utchinson-G ilford  progeria  syndrome 

721  NE  =  Nuclear  envelope 

722  N  PC  =  N  uclear  pore  complex 

723  PSA  =  Prostate-specific  antigen 

724  TAD  =  Topological ly  associating  domain 
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1365  Figure  legends 

1366 

1367  Figure  1:  Non-random  organization  of  the  genome.  The  spatial  organization  of  the  genome  is 

1368  characterized  in  several  ways:  (A)  Radial  positioning  describes  the  position  of  a  locus  relative  to  the 

1369  center  and  periphery  of  the  nucleus.  Gene-poor  chromosomes  (yellow)  generally  locate  to  the  nuclear 

1370  periphery,  whilst  gene-rich  chromosomes  have  a  preference  for  the  nuclear  interior.  (B-E)  Relative 

1371  positioning  describes  the  position  of  a  locus  relative  to  another  nuclear  landmark.  (B)The  position  of 

1372  a  locus  relative  to  another  locus.  The  yellow  and  orange  chromosomes  are  in  close  spatial  proximity, 

1373  whereas  the  orange  and  red  chromosomes  are  distal.  (C)  Association  with  the  nuclear  envelope  (NE). 

1374  Some  genes  co-localize  with  the  NE  at  high  frequency  (yellow  gene).  (D)  Association  with  a  nuclear 

1375  body  such  as  a  splicing  speckle.  (E)  The  position  of  gene  relative  its  own  chromosome  territory. 

1376  Some  gene  rich  loci  loop  out  of  the  bulk  of  the  chromosome  territory  when  highly  expressed. 

1377  Figure  2:  Loci-specific  reorganization  of  the  genome  in  disease.  Certain  loci  adopt  alternative 

1378  nuclear  positions  in  disease  (tick)  compared  to  normal  cells,  whilst  the  positions  of  other  loci  are 

1379  conserved  in  disease  (cross).  Moreover,  the  repositioning  of  some  loci  is  disease  specific,  and 

1380  although  a  gene  repositions  in  one  disease,  it  may  not  in  another  disease.  Blue  =  Nucleus;  NE  = 

1381  N  uclear  envelope;  Cent.  =  Centromere. 
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